IIPC @10

Late in April, in Ljubljana, Slovenia, the International Internet Preservation Consortium gathered for its annual General Assembly. This year is the 10th anniversary of the organization, and we marked the milestone by reflecting on our past accomplishments and thinking about how the members could work together to make positive and lasting impacts on the field of web archiving.

Members of the IIPC at the 2013 General Assembly

The mission of the IIPC is to “to acquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations.” The original vision included collections that were built with common tools and practices that would enable interoperable access across systems and boarders. This vision is still at the core of what the IIPC is trying to accomplish, and many strides have been made toward this goal.

At this year’s GA several discussions centered around how to keep the IIPC  making significant contributions to cultural heritage and remain relevant in a fast-changing web world. In addition to noting achievements the members suggested to the  Steering Committee how to guide the organization in the next 10 years, below are the major themes that came out of these discussions during the entire week of the General Assembly.

Support Common Tools & Standards

The most obvious achievements of the IIPC are open source tools for the harvesting, processing and navigating of web archives—Heritrix, Wayback and WARC Tools. These are the foundational tools used by most of the members and by a larger world-wide community for commercial and preservation purposes. Members absolutely depend on these tools and the web archiving practice has been built around them. These tools have always been open source but they are currently being moved to Github so a broader base of developers can contribute to, maintain and improve the code. The IIPC also developed the ISO preservation standard for web archives, WARC.

Looking toward the next 10 years, members recognize the current tools need to evolve or change to capture what will be the future of the web. The IIPC must continue to support new tools like the Memento Aggregator and the Live HTTP Proxy harvester. A diversity of tools to collect and preserve web sites will be advantageous to future uses because different tools will be able to collect different parts of the web. Some thinking about the need for evolving tools was presented at the  in-depth discussion David Rosenthal of Stanford University and Kris Carpenter of the Internet Archive hosted about the future of web capture.

Build the Collection

The collection built by IIPC members is truly massive and global. Over a petabyte of web sites have been captured and indexed. Entire domains from many countries have been preserved; the newest country to have the legal authority to preserve their whole domain is the United Kingdom. This content will be of great interest to current and future researchers.

IIPC members touring the National and University Library of Slovenia

During the GA an open conference was held titled “Scholarly Use of Web Archives: Progress Requirements and Challenges” where researchers who are using web archives in their work now shared with members their methods, interests and (sometimes) frustrations. No two researchers seem to want the same thing out of web archives.

Sophie Gebeil of Aix-Marseille-Université presented her research into how African immigration is discussed in the French web domain, and analyzed the contents of web sites much like other documentary evidence. While Megan Dougherty of the Loyola University of Chicago explained that she is most interested in specific features and design elements on web pages and how those change over time, not necessarily the intellectual content on the site. Ditte Laursen, a researcher in Denmark, explained she needed to capture second by second changes on social media and television web sites for her work.

IIPC members have web archive content that is of great interest to researchers but providing access to those collections are often a challenge. Two important researcher-lead initiatives to build access tools and establish methodologies in building research corpses were shared that hold great promise in bridging the gap between those who collect and preserve web archives and users.

Build the Community

The IIPC began in 2003 with 12 members. Tool development was the original focus. Standards and best practices for web archiving grew with the organization. The IIPC has grown to include 44 members, all willing to share best practices, develop tools and resources for the global cultural heritage community. It is the primary resource for organizations that are just starting web archiving programs, and it is a venue for organizations with mature web archiving programs that want to advance the field. The work has been truly collaborative as embers worked on projects and tools that met their local needs while contributing to the tools, standards and practices of the web archiving domain. The unique quality of the collaboration has been the focus on shared practice rather than organizational differences. The focus has resulted in a international resource of expertise in web archiving.

In recent years the IIPC established an Education and Training program to fund professional development workshops and sponsoring a PhD student at the University of North Texas Information School for special studies in web archiving. This effort coupled with the outreach and awareness projects the IIPC has taken on are important to building the community for web archiving and maturing the field. The IIPC as an organization is also maturing with the addition of its first full-time employee, Mary Pitt of the British Library, who will serve as both the Program and Communication officer.

***

The National and University Library of Slovenia were perfect hosts for the 2013 IIPC General Assembly. If you want to know more details about what was shared over the week see Ahmed AlSum’s detailed summary of the GA. Ro, Old Dominion University. Rosalie Lack of the California Digital Library also shared her impressions.

All presentations will be posted on the IIPC website.

Ian MacKaye and Citizen Archiving

I think of “citizen archivists” as the first responders of history, arriving early on the scene to gather, capture, describe and preserve ephemeral artifacts of interest and helping to ensure that they survive over time to share with the future. Thoughts on citizen archivists and their importance to institutions like ours were running through my …

Read more »

Picturing Perpetual Beta for Digital Preservation

The perfect digital preservation system does not exist. It may someday, but I don’t expect to live to see it. Instead, people and organizations are working on iterations of systems, and system components, that are gradually improving how we steward digital content over time. This concept of perpetual beta has been around for a while; …

Read more »

New Video: Digital Preservation at the Library of Congress’s Packard Campus for Audio Visual Conservation

We produce occasional short videos  related to digital preservation. These videos address such topics as personal digital archiving, adding descriptions to digital photographs and the K-12 Web Archiving program, to name a few. Our newest video profiles one of the Library of Congress’s most magnificent treasures: the Packard Campus for Audio Visual Conservation, located in …

Read more »

Pass it On: Preservation Week 2013 at the Library of Congress

We are excited to host a number of events in celebration of ALA’s Preservation Week (April 21-27, 2013) here at the Library of Congress next week.  We’ve collaborated with the Preservation Directorate and the Veteran’s History Project on programs to provide in-person guidance and help to those interested in saving their personal collections — digital …

Read more »

Digital Humanities and Digital Preservation

This past April 8 was the 2013 “Day of Digital Humanities.”  Started in 2010, this is an annual event of blogging and tweeting about the experience of digital humanities by graduate students, professors, alt-academics, librarians and other participants who identify with the field.  And “the field” of Digital Humanities can be whatever you define it …

Read more »

Why a National Digital Stewardship Alliance Student Group?

This is a guest post by Kim Schroeder, Wayne State University Lecturer and WSU NDSA Student Group Faculty Advisor. Sometimes a professional conference offers you nothing new. Sometimes it changes your direction. Sometimes and perhaps more critically, it confirms your route. I suppose I look at the NDSA as my professional Global Positioning System. Since …

Read more »

Nominations Now Open for the 2013 NDSA Innovation Awards

The National Digital Stewardship Alliance Innovation Working Group is proud to open the nominations for the 2013 NDSA Innovation Awards. As a diverse membership group with a shared commitment to digital preservation, the NDSA understands the importance of innovation and risk-taking in developing and supporting a broad range of successful digital preservation activities. These awards …

Read more »

From Books to Bits: Library of Congress Electronic Literature Showcase Highlights Emerging Literary Forms

This is a guest post by Susan Garfinkel, research specialist, Digital Reference Section at the Library of Congress. Electronic literature—past, future and present—is the focus of a free three-day program at the Library of Congress, April 3 to 5. The Electronic Literature Showcase, sponsored by the Library’s Digital Reference Section, includes a variety of events designed …

Read more »

First Aid on the Front Lines: Immediate Training Needs

The following is a guest post by Jody DeRidder, the Head of Digital Services at the University of Alabama Libraries. Have you been digitizing and managing digital content?  Are you the go-to person in your organization for accessioning digital materials into your special collections, or collecting electronic records for your archives? More than likely, you’ve …

Read more »