Lowering barriers to using collections in an NDSR workshop with Shawn Averkamp

This is a guest post by Charlotte Kostelic, National Digital Stewardship Resident with the Library of Congress and Royal Collection Trust for the Georgian Papers Programme. Her project focuses on exploring ways to optimize access and use among related digital collections held at separate institutions. This work has included a comparative analysis of international metadata standards and a series of user interviews in order to determine how current practices meet user needs. A final report on this project will be released at the end of December.

On November 13th the Library of Congress hosted Computation in Conversation: Fostering New Fluencies in Collections as Data a lecture and workshop led by Shawn Averkamp, Manager of Metadata Services at New York Public Library. I had the opportunity to host the event as an enrichment session for the National Digital Stewardship Residency (NDSR) program; I hoped to use this opportunity to invite someone who would highlight the ways in which collections as data can be accessible for all library users regardless of their level of technical expertise. Within NDSR, I am working on a project that focuses on user needs for digital collections for research. In the context of my project, research could mean the work an elementary school student does for a class project or the work an academic does in preparing a journal article. What these and all other users have in common is a need for data to be made accessible in formats that they can use and understand.

Averkamp’s presentation focused on how librarians in every department of a library – from public services and outreach to cataloging and systems – can contribute to the development of collections as data with their unique expertise. The presentation also explored how librarians can present collections as data in a way that lowers the barrier to use. In preparing for the workshop and presentation, Averkamp met with librarians across NYPL in order to ask them about the types of data users ask for and the ways in which they use the data sets NYPL has made public. Institutions such as the Library of Congress and NYPL have made significant amounts of data accessible to users. However, Averkamp highlighted that users may not necessarily know what to do with the data or may think that computational work might not be for them.

Venn diagram describing "collections as data" as the overlap of people who can code and people who care about collections

collections as data Slide from Shawn Averkamp’s “Computation in Conversation” workshop on 13 November. Photo by Meghan Ferriter.

While the term computational use might suggest that one needs to be able to write code, it could also mean working with data in a spreadsheet. The barriers to using computational methods with collections as data can be even higher when the labor that goes into transforming data sets into visualizations or other digital projects is not made visible in the end product. Averkamp’s workshop aimed to lower this barrier by presenting simple, free tools that can be used to make a data set ready for computational use. Using just Google Sheets and Timeline.js, workshop attendees were able to standardize dates and load the data into a template so that individual objects from collections held by the Library of Congress and NYPL could be presented in a timeline. Anyone can view Averkamp’s slides or try the workshop on their own by following her guide here: https://github.com/saverkamp/loc-talk-2017.


View of interactive timeline demonstrating titles of publications relating to women's suffrage from 1835-1880, created during Averkamp's workshop.

Example Timeline.js using titles related to Women’s Suffrage at the Library of Congress and the New York Public Library

By presenting each of the steps that it takes to transform a data set into a timeline, Averkamp also highlighted how context can be lost with each change made to the data set. She discussed this loss of context in relation to Caroline Sinders’ concept of the data ethnographer. Sinders highlights the necessity of data ethnographers who will be able to describe the social and cultural contexts in which a data set was created. For library collections as data this could mean publishing a library’s cataloging guidelines, the date the data was created, and the transformations that were made to the data before it was made publicly available. Averkamp’s workshop demonstrated that by documenting the context of their data sets’ creation, as well as providing simple tools for using collections as data, librarians and libraries can lower the barrier to using collections as data.

You can follow Charlotte Kostelic and Shawn Averkamp on Twitter and find Averkamp’s workshop notes on GitHub

Automating Digital Archival Processing at Johns Hopkins University

This is a guest post from Elizabeth England, National Digital Stewardship Resident, and Eric Hanson, Digital Content Metadata Specialist, at Johns Hopkins University.  Elizabeth: In my National Digital Stewardship Residency at Johns Hopkins University’s Sheridan Libraries, I am responsible for a digital preservation project addressing a large backlog (about 50 terabytes) of photographs documenting the university’s […]

Developing a Digital Preservation Infrastructure at Georgetown University Library

This is a guest post by Joe Carrano, a resident in the National Digital Stewardship Residency program. The Joseph Mark Lauinger Memorial Library is at home among the many Brutalist-style buildings in and around Washington, D.C. This granite-chip aggregate structure, the main library at Georgetown University, houses a moderate-sized staff that provides critical information needs […]

Spotlighting Research Data: Building Relationships with Outreach for the NYU Data Catalog

This is a guest post by Nicole Contaxis, Data Catalog Coordinator at NYU Health Sciences Library. You can email her at [email protected] An increasing number of publishers and grant-funding organizations are requiring researchers to share their data, so libraries and other institutions are creating tools and strategies to support researchers in this effort. To meet […]

Using Three-Dimensional Modeling to Preserve Cultural Heritage

This is a guest post by Elizabeth England, a resident in the National Digital Stewardship Residency program. In recent years, a few news stories focused on the use of digital tools in preserving cultural heritage three-dimensional objects, stories such as the printed reconstruction of the Arch of Triumph in Palmyra, Syria and the construction of a […]

Library of Congress Advisory Team Kicks off New Digitization Effort at Eckerd College

This is a guest post by Eckerd College faculty David Gliem, associate professor of Art History, and Nancy Schuler, librarian and assistant professor of Electronic Resources, Collection Development and Instructional Services. On June 3rd, a meeting at Eckerd College in St. Petersburg, Florida, brought key experts and College departments together to begin plans for the […]

Digital Curation and the Public: Strategies for Education and Advocacy

This is a guest post by Jaime Mears. On March 4th, 2016, the Washington DC Public Library hosted Digital Curation and the Public: Strategies for Education and Advocacy at the Martin Luther King, Jr. Memorial Library. It was what the National Digital Stewardship Residents program calls an “enrichment session” and the audience was composed of NDSR colleagues and mentors. […]

Blurred Lines, Shapes, and Polygons, Part 1: An NDSR-NY Project Update

The following is a guest post by Genevieve Havemeyer-King, National Digital Stewardship Resident at the Wildlife Conservation Society Library & Archives. She participates in the NDSR-NY cohort. This post is Part 1 of 2 posts on Genevieve’s exploration of stewardship issues for preserving geospatial data. A few weeks ago, I wrote an article for the […]