F is for Forensics

This post is part of a continuing series of alphabetically titled digital preservation topics.

A few months ago, I met a special collections librarian at a conference. I asked if her library was receiving digital materials in their acquisitions.  She said, “Yes, but we are not doing anything with them at this time.”  I suggested that she begin to follow the work of several efforts in the area of digital preservation forensics.    If preservation of these collections is a goal and the collections are appraised and accessioned, then some considerable work will need to be done to make them useful for the long-term.

This highlights a digital preservation challenge particular to manuscript or other special collections.  If the author, scientist, public official, artist, or academic is notable and of interest for special collections, there is the increasing likelihood that these “papers” collections will be a mixture of digital and paper.  If the appraisal of these collections does not include some inventory of the hardware and software used by the donor to create the documents, then there will be a very intensive investigative task to process the files and materials and make them useful into the future.

Think about receiving a box of cryptically labeled floppy disks.  Where do you begin? Will you have a device that can read them? Will you be able to understand the data on them? Should you copy them to another media? Should you migrate the files to a newer format? Take heart there is good experience to learn from.

by Null Value on Flickr

Forensic preservation requires specific expert knowledge and tools to analyze digital media and files. There is a growing body of expertise being developed in this area.  An extensive discussion of this emerging preservation strategy can be found in Digital Forensics and Born-Digital Content in Cultural Heritage Collections.  This practice borrows tools and approaches from law enforcement and computer science.  It facilitates the analysis of files and media for format, provenance and authenticity. The information derived from the analysis can then be used to migrate or transform the information for future re-use. It can be used to document the history of the data creation and changes.

Analytic tools are also being developed and deployed.  My colleague, Leslie Johnston, wrote a blog post earlier this year that provides extensive links to tools and projects. It is worth reading and following the links.

Many of us enjoy detective fiction and watch crime scene television shows.  Are we ready to become thoughtful investigators of the digital materials that will be the evidence of this time in history?



We Can Haz Standards? Yes, We Can!

The following is a guest post by Jimi Jones, Digital Audiovisual Formats Specialist with the Office of Strategic Initiatives. I’m the co-chair of the National Digital Stewardship Alliance Standards Working Group along with Andrea Goethals of Harvard University. Over the past year, the working group has been engaged in a project to identify, describe and […]

The Artifactual Elements of Born-Digital Records, Part 1

The following is a guest post by Jefferson Bailey, Fellow at the Library of Congress’s Office of Strategic Initiatives. In Carl Fleischhauer’s recent four-part blog series, he discussed the challenges of, and different approaches to, capturing both the informational and the artifactual aspects of physical books and photographic negatives when reproducing these records in digital […]

A Museum Perspective on Digital Preservation

The following is a guest post by Megan Forbes, Manager of Collection Information and Access, Museum of the Moving Image. Several weeks ago, I had the pleasure of attending the Digital Library Federation’s 2011 Fall Forum, where I participated in a panel about data management, digital curation and digital preservation. I felt a bit like […]

Digital Preservation and the 1963 Kennedy Assassination Study

Events associated with the Kennedy assassination offer a compelling case study regarding obsolete data formats and digital preservation. Shortly after the assassination of President Kennedy on this day 48 years ago, an organization turned to the latest computer technology in an effort to study the tragedy.  From November 26 through December 3, 1963, the National […]

Unbreaking News You Can Use: The National Digital Newspaper Program

The following is a guest post by David Brunton, a Supervisory Information Technology Specialist in the Library of Congress Office of Strategic Initiatives. I have heard the National Digital Newspaper Program jokingly described as “putting breaking new online, within 200 years.”  In some ways, it’s a fitting tag line: the most current newspaper pages released […]

Have You Got The Right Stuff? Or, Is Your Digital Content Sustainable?

The following is a guest post by Steve McCollum, Digital Media Project Coordinator, Office of Strategic Initiatives. Central to any digital preservation strategy is making sure that the stuff you have is the right stuff.  To that end, the Library of Congress endeavors to make sure that digital image files delivered by contractors in a […]

It’s Beginning to Look A Lot Like… Election Archiving Season!

The following is a guest post by Abbie Grotke, Web Archiving Team Lead. The United States national elections are a year away, but the Library of Congress is already busy archiving presidential campaign websites and preparing to archive House and Senate campaign sites and more starting in March 2012. This actually isn’t the earliest we’ve […]