How We Do Storage: A Saga of Bits, Disk, Tape and Systems

The following is a guest post by Trevor Owens, Digital Archivist with the Office of Strategic Initiatives.

Over the last six months, the National Digital Stewardship Alliance Infrastructure Working Group has been talking storage. After listening to several presentations on shiny new decentralized, distributed and or cloud storage platforms we quickly realized that the diversity of our members experience meant that we were thinking around each other a fair bit. At that point we decided it would make a lot of sense for the group to start by documenting NDSA member’s approaches to large scale storage for digital preservation and stewardship.

Western Digital WD, by williamhook, on Flickr

Ultimately, we intend to share the results of this project with other organizations looking for guidance about preservation and stewardship storage. We are working to pin down what our collective experience suggests are key elements to consider when planning, implementing and maintaining these systems. In the process, we hope to articulate some of the principles that guide the members approaches to storage systems and architectures.

At the Make it Work conference, the infrastructure group was happy to host Tales from the Crypt: How are we meeting the challenges of large scale storage?, a workshop focused on refining the preliminary results of our work on storage. During the workshop, we shared the responses of an open ended questionnaire from eight members, who represent the diversity of the NDSA membership.  During the workshop, we broke into groups to discuss how the responses from these members resonated with the workshop participants experiences with large scale storage. Discussions and commentary from the different groups was quite lively.  At one point, there were 35 different individuals collaboratively editing the document!

Some areas of discussion included:

óŹ       Member approaches to disk and tape

óŹ       Simple file systems and the value of control

óŹ       The “when” and “why” of format and system migration

óŹ       Member approaches to vendor systems

óŹ       Current and upcoming critical characteristics of storage systems for digital preservation

The History of Tape Storage, by Pargon, on Flickr

Since the workshop, the infrastructure group has been refining this document by adding the comments from workshop participants.  Ultimately, we intend to broadly share the finalized document, particularly with institutions looking to establish their own large scale storage infrastructure for digital stewardship.  Also, we imagine that the resulting document will be of interest to vendors that want to know more about what drives institutional approaches to large scale storage.

Before finalizing this document on the NDSA member’s approaches,  the working group plans on a few substantive additions to the current draft.  Specifically:

óŹ       Creation of a matrix laying out trade offs in different member’s approaches: There was general interest in creating a matrix from some of the documents key points, laying out the pros and cons with different choices and approaches to preservation and stewardship storage. The group plans to draft and integrate a matrix like this into the document.

óŹ       Short targeted survey of the membership: There was general consensus that a targeted survey of the NDSA membership focused on generating some descriptive statistics to back up or complicate some of the claims we are making in the document would be useful. We set a target to launch the survey in November.

 

óŹ       Addition of a brief section focused on definitions and terms: The diversity of the group’s membership reinforces the need for some tight working definitions to facilitate common understanding. For example, something as fundamental as the overlapping definitions for “storage system”, “preservation system”, “storage architecture”, “storage management system”, and “IT infrastructure”. Working group members are also creating a short addition for the document that tightens up our use of terminology.

The group will share, and invite further discussion and comment on, this work in progress at the next “Designing Storage Architectures for Preservation Collections” meeting this September. After that, and some more work from the working group, we will more broadly distribute the resulting report. You can expect to hear more about this project, and any report that comes out of it, here on this blog.  Any NDSA member interested in being involved in this initiative can contact the infrastructure working group co-chairs. If your organization is not currently an NDSA member, please consider joining the Alliance.

Add a Comment

This blog is governed by the general rules of respectful civil discourse. You are fully responsible for everything that you post. The content of all comments is released into the public domain unless clearly stated otherwise. The Library of Congress does not control the content posted. Nevertheless, the Library of Congress may monitor any user-generated content as it chooses and reserves the right to remove content for any reason whatever, without consent. Gratuitous links to sites are viewed as spam and may result in removed comments. We further reserve the right, in our sole discretion, to remove a user's privilege to post content on the Library site. Read our Comment and Posting Policy.

Required fields are indicated with an * asterisk.