Digital Disaster Planning: Get the Picture Before Losing the Picture

The following is a guest post by Chelcie Rowell, 2012 Junior Fellow.

Frequency of occurring? Rare. Impact of occurring? Huge. I’m talking about digital disasters.

Zine Symposium - Russell Square London 2006, by szczel, on Flickr

Zine Symposium - Russell Square London 2006, by szczel, on Flickr

Stewards of digital content, like stewards of analog content, must plan for catastrophe in advance in order to minimize loss and recover quickly. True, digital disasters may occur infrequently. But at the scale that institutions collect digital content and for the length of time that institutions wish to preserve digital content the risk of a disaster is non-trival.

Disasters may be natural (such as tornadoes and earthquakes) or failures of infrastructure (such as  power failures). Disasters may result from intentional human action (such as cyberterrorism) or simply human error (such as accidental deletion).

A digital disaster negatively impacts an institution’s digital content. What distinguishes many catastrophes that threaten digital content from those that threaten analog content is that digital disasters may be much less visible.  Bit rot is a one-in-a-million occurrence, for example, but when it happens special tools are needed to seek it out and prevent a digital disaster.

At a recent digital disaster planning workshop, Jessica Branco Colati walked participants through the process of preparing for and recovering from catastrophe. The importance of this is highlighted by two recent headlines that provide concrete examples of the stakes involved with disaster planning.

When the website for avant-garde 3:AM Magazine suddenly disappeared, what staff initially believed to be a glitch quickly turned into deeper concern when the service provider responsible for managing the site’s servers was unable to be reached.  Editor Andrew Gallix was quoted as saying “I never expected those who were meant to host and back up our content to just switch us off without even telling us.”  The extent of the digital disaster was difficult to assess due to crucial failures of communication. Unable to contact their service provider, staff felt powerless to take any action to recover their content. Referring to the missing service provider, Gallix said, “At this stage, we do not know if we’ll every be able to speak to him and if he can switch his server back on long enough to allow us to move 12 years’ worth of content to another, more reliable host.”

Lego Woody, by jamiejohndavies, on Flickr

Lego Woody, by jamiejohndavies, on Flickr

Pixar faced a digital disaster of comparably catastrophic impact involving the film Toy Story 2As described by a Pixar technical editor, an accidental deletion wiped the working files before the film was finished.  What audiences experience as an animated film is actually a complex digital object that contains thousands upon thousands of smaller files. Combined, these files are rendered into frames—including animation, set, and lighting data—that sequentially make up the moving image.

As the accidental deletion unfolded, pieces of that complex digital object were removed from disk, seemingly before the makers’ very eyes. As Oren Jacob, the film’s assistant technical director, put it “Woody’s hat disappeared. And then his boots disappeared. And then as we kept checking, he disappeared entirely. Woody’s gone.”

Fortunately, the studio was able to quickly restore the film from backups. But after the backups were revealed to be corrupt, the only recourse was to inventory different versions and perform human-intensive quality review to stitch together enough valid data to render a relatively complete film. Jacob recalled, “In the end, human eyes scanned, read, understood, looked for weirdness, and made a decision on something like 30,00 files that weekend.”

Both these episodes raise the issue of risk tolerance.  When an institution manages unique digital materials, it needs to seriously consider what steps have to be taken to prevent–or at least minimize–loss.

This post was edited on 7/20/2012 to correct typographical errors.

4 Comments

  1. h macfarland
    July 19, 2012 at 5:03 pm

    Digital corruption may begin before posting: This text contains two elementary errors–can you find them?

  2. Willie Jones
    July 20, 2012 at 10:38 am

    I totally agree with the importance and validity of this content and would like to use it in some of my training materials as I present Disaster Preparedness to other UM staff.

    Request to Use:
    I will retain all of the Post Information here that identifies the writer of this article.

    Thanks in advance,
    Willie Jones, UM Records Analyst

  3. Bill LeFurgy
    July 20, 2012 at 10:47 am

    Typos fixed.

  4. Chelcie Rowell
    July 20, 2012 at 11:09 am

    Willie, I’m happy that you find these examples useful! The disaster planning workshop led by Jessica Branco Colati of the Northeast Document Conservation Center provided an excellent framework for thinking about digital disasters, which enriched my understanding of these headlines I encountered on the Web. Another aspect of the workshop that I appreciated was the emphasis on making realistic decisions based on the institution’s available resources.

Add a Comment

This blog is governed by the general rules of respectful civil discourse. You are fully responsible for everything that you post. The content of all comments is released into the public domain unless clearly stated otherwise. The Library of Congress does not control the content posted. Nevertheless, the Library of Congress may monitor any user-generated content as it chooses and reserves the right to remove content for any reason whatever, without consent. Gratuitous links to sites are viewed as spam and may result in removed comments. We further reserve the right, in our sole discretion, to remove a user's privilege to post content on the Library site. Read our Comment and Posting Policy.

Required fields are indicated with an * asterisk.