<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>The Signal: Digital Preservation</title>
	<atom:link href="http://blogs.loc.gov/digitalpreservation/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.loc.gov/digitalpreservation</link>
	<description>The Signal: Digital Preservation</description>
	<lastBuildDate>Wed, 22 May 2013 20:54:58 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.1</generator>
		<item>
		<title>Hey Content Creator: Make Mine Lossless!</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/hey-content-creator-make-mine-lossless/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/hey-content-creator-make-mine-lossless/#comments</comments>
		<pubDate>Wed, 22 May 2013 16:52:53 +0000</pubDate>
		<dc:creator>Butch Lazorchak</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[Personal Archiving]]></category>
		<category><![CDATA[audio]]></category>
		<category><![CDATA[compression]]></category>
		<category><![CDATA[images]]></category>
		<category><![CDATA[lossless]]></category>
		<category><![CDATA[lossy]]></category>
		<category><![CDATA[mp3]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14818</guid>
		<description><![CDATA[I’ve always loved the term “lossy” compression (add a “y” to anything and the “cute” factor really goes up). But just like a baby tiger is cute only so long as you understand that it will one day grow into a vicious, man-eating beast, lossy compression is cute only so long as you understand that [...]]]></description>
			<content:encoded><![CDATA[<p>I’ve always loved the term “lossy” compression (add a “y” to anything and the “cute” factor really goes up). But just like a baby tiger is cute only so long as you understand that it will one day grow into a vicious, man-eating beast, lossy compression is cute only so long as you understand that it may someday come back and bite you if you’re thinking about long-term preservation.</p>
<div id="attachment_14843" class="wp-caption alignleft" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/digital-compression.jpg"><img class="size-medium wp-image-14843" title="digital-compression" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/digital-compression-300x85.jpg" alt="Digital Compression by user spacepleb on Flickr" width="300" height="85" /></a><p class="wp-caption-text">Digital Compression by user spacepleb on Flickr</p></div>
<p>That sounds a bit hyperbolic so let me step back a bit. In 2011 <a href="http://blogs.loc.gov/digitalpreservation/2011/10/four-easy-tips-for-preserving-your-digital-photographs/">I wrote about IDOM</a>, four simple steps to helping you start thinking about how to preserve your own digital materials (for the record, it’s Identify, Decide, Organize and Make copies). One undeniable factor in “make copies” is that there’s a trade-off everyone has to make between quality and affordability.</p>
<p>We all want to store our digital data at the highest quality possible, but higher quality generally means larger file sizes, which means more storage which means more money. Compressed data, generally speaking, takes up less physical storage space and moves more easily over networks. The file size difference can be dramatic.</p>
<p>Let’s say you wanted to rip your CD collection and store it as high-quality <a href="http://www.digitalpreservation.gov/formats/fdd/fdd000001.shtml">WAVE</a> files on an external hard drive. A digital file that holds a typical three-minute song on a CD is 30–40 megabytes in size so an average CD <a href="http://www.ehow.com/way_5444845_many-megabytes-per-lp-record.html">would be around 450 megabytes</a>. If you had 1000 CDs in your collection you’d need about ½ a terabyte of storage. Things aren’t so bad these days, cost-wise: ½ terabyte would only run you about $40 (<a href="http://www.mkomo.com/cost-per-gigabyte">10 years ago it would have run you almost $1200.)</a></p>
<div id="attachment_14847" class="wp-caption alignleft" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/vinyl-kills.jpg"><img class="size-medium wp-image-14847" title="vinyl-kills" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/vinyl-kills-300x200.jpg" alt="vinyl kills the mp3 industry by user karola on Flickr" width="300" height="200" /></a><p class="wp-caption-text">vinyl kills the mp3 industry by user karola on Flickr</p></div>
<p>Now lets say you wanted to save storage space by compressing the audio. The <a href="http://www.digitalpreservation.gov/formats/fdd/fdd000012.shtml">MPEG Layer III Audio Encoding</a> (MP3 for short) <a href="http://www.techradar.com/us/news/audio/how-mp3-compression-works-916093">typically reduces the file size for an audio song by an order of magnitude.</a> So that half a terabyte would now be around 220 gigabytes and cost you roughly $20 total (prices for external hard drives fluctuate quite a bit so don’t hold me to these prices!).</p>
<p>However, when we’re thinking about preserving digital information we generally want to avoid compressing the data, unless we can compress it “losslessly.” “Lossless” compression means that we can shrink the size of any arbitrary piece of digital content, but we can also bring it back to its original size without losing any information in the transformation process.</p>
<p>&#8220;Lossy&#8221; compression, on the other hand, is a data encoding method that compresses data by removing part of it. Different compression schemes apply different algorithms to determine how to effectively discard the data while keeping the image within an acceptable level of quality as determined by the user’s needs, but there’s no getting around the fact that once the data is discarded under “lossy” compression schemes it’s gone for good.</p>
<p>While institutions (and individuals) want to save on costs as much as possible, we all want to retain as much of the utility of the information as we possibly can. We have no idea how much storage or <a href="http://www.technologyreview.com/news/429722/a-bandwidth-breakthrough/">bandwidth</a> will cost in the future (<a href="http://lj.libraryjournal.com/2011/09/opinion/our-worst-nightmare-the-end-of-unlimited-storage-and-bandwidth/">hopefully less</a>) nor do we know what future users might do with current data (<a href="http://www.spiegel.de/international/business/big-data-enables-companies-and-researchers-to-look-into-the-future-a-899964.html">undoubtedly many interesting things</a>), but we’re pretty sure we want to keep our options open.</p>
<p>An MP3 is an example of lossy compression. If you compress that original WAVE file utilizing the MP3 compression scheme the information you remove to decrease the file size is gone for good and you can’t bring it back. It is possible to convert your MP3 back to a WAVE file using <a href="http://www.labnol.org/software/how-to-convert-files/19523/">available software tools</a>, but all you’ll have is a mediocre WAVE file. The original information is gone and you can definitely <a href="http://www.stereophile.com/features/308mp3cd/">hear the difference</a>.</p>
<p>So if you want to preserve an audio file for the long-term you either need to keep it in its original format or utilize a compression scheme that allows you roll back your compressed file to its original form.</p>
<p>There are a number of <a href="http://en.wikipedia.org/wiki/Lossless_compression#Audio">lossless compressions schemes for audio</a>, though they’re not implemented equally by the major digital media players.</p>
<p><a href="http://blogs.loc.gov/digitalpreservation/files/2011/10/butch_dogg2.jpg"><img class="alignleft size-medium wp-image-3322" title="butch_dogg" src="http://blogs.loc.gov/digitalpreservation/files/2011/10/butch_dogg2-300x225.jpg" alt="" width="300" height="225" /></a>The same holds true for photographs. For example, let’s look at my “butch dogg” picture from the IDOM article.</p>
<p>This image is stored in <a href="http://www.jpeg.org/">Joint Photographic Experts Group (JPEG)</a> format which is a compressed format. Sadly, JPEGs are a form of <a href="http://www.jiscdigitalmedia.ac.uk/guide/file-formats-and-compression">“lossy” compression</a>.</p>
<p>Of course, a large amount of data can be discarded before the result is sufficiently degraded to be noticed by the user, but it’s the same situation as the audio described above. Had I been thinking long-term I might have made a different decision on the final-state format for my photo.</p>
<p>If planning these things out from the start, it’s most advantageous to start with a high-resolution master lossless file that can then be used to produce compressed files for different purposes; for example, a multi-megabyte file can be used at full size to produce a full-page advertisement in a glossy magazine while a smaller, lossy copy can be made for a small image on a web page.</p>
<p>A consideration of lossy vs. lossless compression is <a href="http://www.digitalpreservation.gov/formats/content/still_preferences.shtml">just one factor</a> in identifying sustainable stewardship practices, but it’s an important one to consider, especially at the start of a digital workflow. The Still Image Working Group of the Federal Agencies Digitization Guidelines Initiative <a href="http://blogs.loc.gov/digitalpreservation/2012/04/seeking-miraculous-and-lossless-victories-an-update-on-the-fadgi-still-image-working-group/">has been exploring</a> these issues in great depth.</p>
<p>Consensus is still developing on most sustainable preservation master formats (see recommendations from <a href="http://www.archives.gov/preservation/products/definitions/preservation-copy.html">NARA</a>, the <a href="http://dpbestflow.org/file-format/file-format-overview#which">American Society of Media Photographers</a> and <a href="http://blogs.loc.gov/digitalpreservation/2011/10/digital-preservation-file-formats-for-scanned-images/">others</a>) but compression is certainly one of the big issues to consider.</p>
<p>The stewardship community will undoubtedly spend plenty of time managing and preserving lossy files (<a href="http://socialcompare.com/en/comparison/image-file-formats">huge numbers</a> of JPEGs and MP3 files are already out there), but if you’ve got the option make yours lossless!</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/hey-content-creator-make-mine-lossless/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Explore Innovative Ideas Across the Digital Information Landscape at Digital Preservation 2013</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/explore-innovative-ideas-across-the-digital-information-landscape-at-digital-preservation-2013/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/explore-innovative-ideas-across-the-digital-information-landscape-at-digital-preservation-2013/#comments</comments>
		<pubDate>Tue, 21 May 2013 15:21:12 +0000</pubDate>
		<dc:creator>Erin Engle</dc:creator>
				<category><![CDATA[NDSA]]></category>
		<category><![CDATA[Outreach and Events]]></category>
		<category><![CDATA[Partners and Collaboration]]></category>
		<category><![CDATA[digital preservation 2013]]></category>
		<category><![CDATA[National Digital Stewardship Alliance]]></category>
		<category><![CDATA[ndsa]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[partners meeting]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14756</guid>
		<description><![CDATA[We like to think (and hope) that our blog The Signal acts as an informative resource from which to learn and engage in conversations of digital preservation work.  We hope that it exposes you to interesting projects and people stewarding digital collections, and that it creates opportunities to expose you to the wider community of [...]]]></description>
			<content:encoded><![CDATA[<p>We like to think (and hope) that our blog <em>The Signal</em> acts as an informative resource from which to learn and engage in conversations of digital preservation work.  We hope that it exposes you to interesting projects and people stewarding digital collections, and that it creates opportunities to expose you to the wider community of libraries, archives, museums, government agencies, and other organizations.</p>
<p>Just as the blog is a vehicle to explore challenges and solutions practitioners address managing digital information, we’re coming up on an excellent opportunity to share best practices and lessons learned from our individual and collective work – face-to-face.</p>
<div id="attachment_14786" class="wp-caption alignright" style="width: 296px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/dp12_dash.jpg"><img class=" wp-image-14786" title="Anil Dash speaking at the Digital Preservation 2012 meeting." src="http://blogs.loc.gov/digitalpreservation/files/2013/05/dp12_dash-300x189.jpg" alt="Anil Dash speaking at the Digital Preservation 2012 meeting." width="286" height="180" /></a><p class="wp-caption-text">Anil Dash speaking at the Digital Preservation 2012 meeting.</p></div>
<p style="text-align: left;">That opportunity is <a style="text-align: left;" href="http://www.digitalpreservation.gov/meetings/ndiipp13.html">Digital Preservation 2013</a>, the annual meeting of the <a href="http://www.digitalpreservation.gov/">National Digital Information Infrastructure and Preservation Program</a> and the <a href="http://www.digitalpreservation.gov/ndsa/">National Digital Stewardship Alliance</a>.  This year’s meeting will be held on July 23-25 in Alexandria, VA. <a href="http://www.digitalpreservation.gov/meetings/register/index.html" target="_blank">Registration</a> for the meeting is now open.</p>
<p>Once a year, we aim to put together a program that stimulates new thinking and exposes us to innovative ideas across the digital information landscape. It&#8217;s a chance to highlight collaborative projects on preserving and accessing digital collections, feature discussions of current tools and services for the management of digital content, and share current approaches to storage and infrastructure challenges.  For the ninth year now, we’ve been fortunate to gather members of the community and other interested professionals in an environment to share information about collecting, preserving, and delivering our cultural heritage, scientific and other valuable digital materials.</p>
<p>This year is no different and the planning committee has been busy crafting a solid, if not packed, agenda featuring speakers who are strong advocates for social and technological innovation. We’ll hear about topics such as managing “big data” at scale, the cultural and scholarly value of public and historical digital content, and the challenges of preserving digital resources and exploring how to provide access to them.</p>
<p>The meeting kicks off on the afternoon of July 23 with keynotes talks by <a href="http://www.hilarymason.com/" target="_blank">Hilary Mason</a>, chief scientist at <a href="https://bitly.com/" target="_blank">bit.ly</a>, and <a href="http://sarahwerner.net/blog/" target="_blank">Sarah Werner</a>, undergraduate program director at the <a href="http://www.folger.edu/" target="_blank">Folger Shakespeare Library</a>.  Later in the afternoon, we’ll be hearing from a panel of educators and practitioners including <a href="http://www.ils.unc.edu/callee/">Christopher (Cal) Lee</a> of <a href="http://sils.unc.edu/" target="_blank">University of North Carolina at Chapel Hill</a>; <a href="http://blogs.loc.gov/digitalpreservation/2012/02/jason-scott-rogue-archivist/">Jason Scott</a> of <a href="http://www.archiveteam.org/">the Archive Team</a>; <a href="http://blogs.loc.gov/digitalpreservation/2012/12/imagine-what-well-know-this-time-next-week-an-interview-with-bailey-smith-and-anne-wootton-of-pop-up-archive/">Anne Wootton</a> of <a href="http://www.popuparchive.org/" target="_blank">Pop-Up Archive</a>; and Travis May of the <a href="http://www.stlouisfed.org/" target="_blank">Federal Reserve Bank of St. Louis</a> on their own innovative approaches to preservation of various types of content.  We will round out the day with a series of lightning talks and a poster session from members of the National Digital Stewardship Alliance community.</p>
<p>On July 24, Lisa Green, executive director at <a href="http://commoncrawl.org/" target="_blank">Common Crawl</a>, and Emily Gore, director for content at <a href="http://dp.la/" target="_blank">Digital Public Library of America</a>, will lead off the day with opening talks. Following, there will be a unique dialogue on the emerging topic of digital preservation and environmental sustainability, the &#8220;Green Bytes: Sustainable Approaches to Digital Stewardship&#8221; panel with <a href="http://www.lockss.org/contact-us/dshr/" target="_blank">David Rosenthal</a> of  <a href="http://www.lockss.org/" target="_blank">Stanford University</a>; Kris Carpenter of the <a href="http://archive.org/index.php" target="_blank">Internet Archive</a>; and <a href="http://www.kkant.net/" target="_blank">Krishna Kant</a> of <a href="http://csis.gmu.edu/" target="_blank">George Mason University</a> and the <a href="http://www.nsf.gov/div/index.jsp?div=CNS" target="_blank">National Science Foundation</a>.  The final panel of the day features speakers including <a href="http://www.cooperhewitt.org/conversations/2013/04/03/meet-staff-aaron-straup-cope" target="_blank">Aaron Straup Cope</a> of <a href="http://labs.cooperhewitt.org/" target="_blank">Cooper-Hewitt Museum Labs</a>; <a href="http://civic.mit.edu/users/rodrigodavies" target="_blank">Rodrigo Davies</a> from the <a href="http://civic.mit.edu/" target="_blank">MIT Center for Civic Media</a>; and <a href="http://amyrobinson.me/about/" target="_blank">Amy Robinson</a> of the <a href="http://eyewire.org/" target="_blank">EyeWire</a> project talking about issues at the leading edge of digital stewardship activity.</p>
<p>Day two will also feature a variety of smaller breakout sessions that will enable conversation around topics of digital preservation education and training, demos of digital preservation tools and services, and presentations on digital curation topics.  And there will be special presentations by the NDSA Innovation Award winners, who will be named in the coming weeks.</p>
<div id="attachment_14787" class="wp-caption alignleft" style="width: 270px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/CurateCamp2012.jpg"><img class=" wp-image-14787 " title="CurateCamp Processing participants collaborating the day's agenda." src="http://blogs.loc.gov/digitalpreservation/files/2013/05/CurateCamp2012-300x225.jpg" alt="CurateCamp Processing participants collaborating the day's agenda." width="260" height="195" /></a><p class="wp-caption-text">CurateCamp Processing participants collaborating the day&#8217;s agenda.</p></div>
<p>On July 25, day three, we’re planning to co-host a <a href="http://curatecamp.org/about" target="_blank">CURATEcamp</a>, an <a href="http://en.wikipedia.org/wiki/Unconference" target="_blank">unconference</a>. Last year we co-hosted <a href="http://wiki.curatecamp.org/index.php/CURATEcamp_Processing_2012" target="_blank">CURATEcamp: Processing</a> that focused on the intersection between archival and computational notions of processing.  This year, we wanted to focus more on the digital stewardship community&#8217;s perspectives and discuss ideas about the exhibition of digital collections dealing with narratives, storytelling, and context.  Any curators, archivists, librarians, scholars, software developers, computer engineers and others looking to share, demonstrate and refine ideas about exhibition in the digital age are encouraged to attend and contribute to crafting the day&#8217;s discussions. More information about the camp will be shared on this blog in the next couple of weeks.</p>
<p>Digital Preservation 2013 will be held at the Westin Alexandria in Old Town Alexandria, VA.  There are two separate registration forms, one of the <a href="http://www.digitalpreservation.gov/meetings/register/register.php" target="_blank">main meeting</a> and one for <a href="https://docs.google.com/forms/d/1k_2ZrmN38-VTjKeMgsHgqRdHQRKcMLAffGuQ9s6zUkY/viewform" target="_blank">CURATEcamp: Exhibition</a>.  If you plan to attend both, please make sure to register using the separate forms.</p>
<p>There is no cost to attend either meeting but seats are limited and available on a first come, first served basis.</p>
<p>If you can’t make it, the first day will be taped and presentations will be posted on <a href="http://www.digitalpreservation.gov/meetings/ndiipp13.html">our website</a> a few weeks following the meeting.  You can also follow the event on Twitter at <a href="https://twitter.com/search?q=%23digpres13&amp;src=typd" target="_blank">#digpres13</a>.</p>
<p><em>Updated 5/22: fixed typos</em></p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/explore-innovative-ideas-across-the-digital-information-landscape-at-digital-preservation-2013/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IIPC @10</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/iipc-10/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/iipc-10/#comments</comments>
		<pubDate>Fri, 17 May 2013 20:07:32 +0000</pubDate>
		<dc:creator>Abbey Potter</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[Outreach and Events]]></category>
		<category><![CDATA[Partners and Collaboration]]></category>
		<category><![CDATA[Tools and Infrastructure]]></category>
		<category><![CDATA[IIPC]]></category>
		<category><![CDATA[International Internet Preservation Consortium]]></category>
		<category><![CDATA[web archives]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14712</guid>
		<description><![CDATA[Late in April, in Ljubljana, Slovenia, the International Internet Preservation Consortium gathered for its annual General Assembly. This year is the 10th anniversary of the organization, and we marked the milestone by reflecting on our past accomplishments and thinking about how the members could work together to make positive and lasting impacts on the field of [...]]]></description>
			<content:encoded><![CDATA[<p>Late in April, in Ljubljana, Slovenia, the International Internet Preservation Consortium gathered for its annual General Assembly. This year is the 10th anniversary of the organization, and we marked the milestone by reflecting on our past accomplishments and thinking about how the members could work together to make positive and lasting impacts on the field of web archiving.</p>
<div id="attachment_14737" class="wp-caption alignleft" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/Table.jpg"><img class="size-medium wp-image-14737" title="Table" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/Table-300x200.jpg" alt="" width="300" height="200" /></a><p class="wp-caption-text">Members of the IIPC at the 2013 General Assembly</p></div>
<p>The <a href="http://netpreserve.org/about-us/mission-goals">mission</a> of the IIPC is to &#8220;to acquire, preserve and make accessible knowledge and information from the Internet for future generations everywhere, promoting global exchange and international relations.&#8221; The original vision included collections that were built with common tools and practices that would enable interoperable access across systems and boarders. This vision is still at the core of what the IIPC is trying to accomplish, and many strides have been made toward this goal.</p>
<p>At this year&#8217;s GA several discussions centered around how to keep the IIPC  making significant contributions to cultural heritage and remain relevant in a fast-changing web world. In addition to noting achievements the members suggested to the  <a href="http://netpreserve.org/leadership">Steering Committee</a> how to guide the organization in the next 10 years, below are the major themes that came out of these discussions during the entire week of the General Assembly.</p>
<p><strong>Support Common Tools &amp; Standards</strong></p>
<p>The most obvious achievements of the IIPC are open source tools for the harvesting, processing and navigating of web archives—Heritrix, Wayback and WARC Tools. These are the foundational tools used by most of the members and by a larger <a href="https://webarchive.jira.com/wiki/display/Heritrix/Users+of+Heritrix">world-wide community</a> for commercial and preservation purposes. Members absolutely depend on these tools and the web archiving practice has been built around them. These tools have always been open source but they are currently being moved to <a href="https://github.com/iipc">Github</a> so a broader base of developers can contribute to, maintain and improve the code. The IIPC also developed the ISO preservation standard for web archives, <a href="http://www.iso.org/iso/catalogue_detail.htm?csnumber=44717">WARC</a>.</p>
<p>Looking toward the next 10 years, members recognize the current tools need to evolve or change to capture what will be the future of the web. The IIPC must continue to support new tools like the <a href="http://netpreserve.org/projects/memento">Memento Aggregator</a> and the <a href="http://netpreserve.org/projects/live-archiving-http-proxy">Live HTTP Proxy</a> harvester. A diversity of tools to collect and preserve web sites will be advantageous to future uses because different tools will be able to collect different parts of the web. Some thinking about the need for evolving tools was presented at the  <a href="http://blog.dshr.org/2013/04/talk-on-harvesting-future-web-at.html">in-depth discussion</a> David Rosenthal of Stanford University and Kris Carpenter of the Internet Archive hosted about the future of web capture.</p>
<p><strong>Build the Collection</strong></p>
<p>The collection built by IIPC members is truly massive and global. Over a petabyte of web sites have been captured and indexed. Entire domains from many countries have been preserved; the newest country to have the legal authority to preserve their whole domain is the United Kingdom. This content will be of great interest to current and future researchers.</p>
<div id="attachment_14738" class="wp-caption alignright" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/Tour.jpg"><img class="size-medium wp-image-14738" title="Tour" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/Tour-300x169.jpg" alt="" width="300" height="169" /></a><p class="wp-caption-text">IIPC members touring the National and University Library of Slovenia</p></div>
<p>During the GA an open conference was held titled &#8220;Scholarly Use of Web Archives: Progress Requirements and Challenges&#8221; where researchers who are using web archives in their work now shared with members their methods, interests and (sometimes) frustrations. No two researchers seem to want the same thing out of web archives.</p>
<p>Sophie Gebeil of Aix-Marseille-Université presented her research into how <a href="http://netpreserve.org/resources/memories-north-african-immigration-web">African immigration</a> is discussed in the French web domain, and analyzed the contents of web sites much like other documentary evidence. While Megan Dougherty of the Loyola University of Chicago explained that she is most interested in specific features and design elements on web pages and <a href="http://netpreserve.org/resources/researcher-engagement-web-archives-building-community-developing-tools-and-codifying">how those change over time</a>, not necessarily the intellectual content on the site. Ditte Laursen, a researcher in Denmark, explained she needed to capture second by <a href="http://netpreserve.org/resources/automated-screen-capturing-study-interplay-between-real-time-internet-and-live-television">second changes on social media and television web sites</a> for her work.</p>
<p>IIPC members have web archive content that is of great interest to researchers but providing access to those collections are often a challenge. Two important researcher-lead initiatives to <a href="http://netpreserve.org/resources/webart-facilitating-scholarly-use-web-archives">build access tools</a> and establish methodologies in <a href="http://netpreserve.org/resources/scholarly-use-web-archives-initiatives-and-fundamental-tools">building research corpses </a>were shared that hold great promise in bridging the gap between those who collect and preserve web archives and users.</p>
<p><strong>Build the Community</strong></p>
<p>The IIPC began in 2003 with 12 members. Tool development was the original focus. Standards and best practices for web archiving grew with the organization. The IIPC has grown to include 44 members, all willing to share best practices, develop tools and resources for the global cultural heritage community. It is the primary resource for organizations that are just starting web archiving programs, and it is a venue for organizations with mature web archiving programs that want to advance the field. The work has been truly collaborative as embers worked on projects and tools that met their local needs while contributing to the tools, standards and practices of the web archiving domain. The unique quality of the collaboration has been the focus on shared practice rather than organizational differences. The focus has resulted in a international resource of expertise in web archiving.</p>
<p>In recent years the IIPC established an Education and Training program to fund professional development <a href="http://netpreserve.org/projects/how-fit-integrate-web-archiving-program-your-organization">workshops</a> and <a href="http://netpreserve.org/projects/phd-sponsorship">sponsoring a PhD student</a> at the University of North Texas Information School for special studies in web archiving. This effort coupled with the <a href="http://netpreserve.org/web-archiving/videos">outreach and awareness</a> projects the IIPC has taken on are important to building the community for web archiving and maturing the field. The IIPC as an organization is also maturing with the addition of its first full-time employee, Mary Pitt of the British Library, who will serve as both the Program and Communication officer.</p>
<p>***</p>
<p>The National and University Library of Slovenia were perfect hosts for the 2013 IIPC General Assembly. If you want to know more details about what was shared over the week see <a href="http://ws-dl.blogspot.com/2013/05/2013-04-22-iipc-ga-2013.html">Ahmed AlSum&#8217;s</a> detailed summary of the GA. Ro, Old Dominion University. Rosalie Lack of the California Digital Library also shared her <a href="http://www.cdlib.org/cdlinfo/2013/05/16/five-things-i-learned-at-iipc/">impressions</a>.</p>
<p>All presentations will be posted on the IIPC <a href="http://netpreserve.org/tags/2013-ga">website</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/iipc-10/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>What Do Researchers Want From Institutions that Preserve Digital Content?</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/what-do-researchers-want-from-institutions-that-preserve-digital-content/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/what-do-researchers-want-from-institutions-that-preserve-digital-content/#comments</comments>
		<pubDate>Thu, 16 May 2013 20:27:38 +0000</pubDate>
		<dc:creator>Bill LeFurgy</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[research use]]></category>
		<category><![CDATA[web archives]]></category>
		<category><![CDATA[web archiving]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14710</guid>
		<description><![CDATA[A smart-alecky way to answer the question in the title above would be: &#8220;why everything, of course.&#8221;  But we don&#8217;t traffic in snark here, at least not intentionally. User expectations influence so much of what stewardship organizations do. We collect and preserve all content primarily to support use, but the issue is especially important in [...]]]></description>
			<content:encoded><![CDATA[<p>A smart-alecky way to answer the question in the title above would be: &#8220;why everything, of course.&#8221;  But we don&#8217;t traffic in snark here, at least not intentionally.</p>
<div id="attachment_14717" class="wp-caption alignright" style="width: 350px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/to.jpg"><img class="size-full wp-image-14717" title="to" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/to.jpg" alt="To provide support to Clemson researchers at the times and places they need it, by By clemsonunivlibrary, on Flickr" width="340" height="254" /></a><p class="wp-caption-text">To provide support to Clemson researchers at the times and places they need it, by <a href="http://www.flickr.com/photos/clemsonunivlibrary/6838379276/">clemsonunivlibrary,</a> on Flickr</p></div>
<p>User expectations influence so much of what stewardship organizations do. We collect and preserve all content primarily to support use, but the issue is especially important in a digital context.</p>
<p>Digital stewardship at scale can involve significant resources over a long period of time, and justifying that allocation centers around access. As the <a href="http://brtf.sdsc.edu/">Blue Ribbon Task Force on Sustainable Preservation and Access</a> noted as a <a href="http://brtf.sdsc.edu/biblio/BRTF_Final_Report.pdf">primary finding</a>,  &#8221;when making the case for preservation, make the case for use. Without well articulated demand for preserved information, there will be no future supply.&#8221;</p>
<p>The matter jumped to the forefront of my attention a few weeks ago during <a href="http://www.webarchiving.nl/news/catchmeeting">Archiving the Web: How to Support Research of Future Heritage?,</a> a meeting at the National Library of the Netherlands.  All the presentations dealt with the issue of connecting with researchers, but one by <a href="http://thepoliticsofsystems.net/about/">Bernhard Rieder</a> struck me as intriguingly proactive. Rieder is a researcher in the Media Studies department of the University of Amsterdam and is interested in using big collections of web archives to study social patterns and trends. He talked about the methodological challenges of using such collections and concluded with a  <a href="http://www.webarchiving.nl/news/theinternetarchivesitselfdoesntititscalledgoogle">&#8220;wish list&#8221;</a> of what he wanted from institutions that collect web archives. While I do not necessarily agree with these wishes, I did appreciate hearing his perspective. Here is a summary of the list:</p>
<ol>
<li>Moral, political and legal support for researchers and others building large collections of web data.</li>
<li>Fast search based access to a continuous 1% sample of institutional holdings with translated URLs and click data, as well as comprehensive statistics for the collection.</li>
<li>An easy, non-bureaucratic way to submit researcher data collections to institutions, without having to use standardized formats.</li>
</ol>
<p>Other researchers surely have their own wishes, and it does seem to me that libraries and archives could benefit from hearing them. There is an novel approach to do this underway at the University of Melbourne, where the IT Research Services Department <a href="http://dfflanders.wordpress.com/2013/04/15/resbaz/">actively engages with researchers,</a> including face-to-face meetings to learn about what they want. The university found that researchers appreciate the chance to meet, and as a result the department is taking an active approach to social engagement with a “Bazaar” of choices and approaches. Many of the approaches focus on helping researchers learn about tools and how to best apply them in working with data collections.</p>
<p>We also have explored the issue on this blog, with a series of posts by guest posts by Kalev H. Leetaru on <a href="http://blogs.loc.gov/digitalpreservation/2012/05/a-vision-of-the-role-and-future-of-web-archives-conclusions-and-the-role-of-archives/">A Vision of the Role and Future of Web Archives.</a> The International Internet Preservation Consortium also recently held a meeting on the subject of <a href="http://blogs.loc.gov/digitalpreservation/2013/02/call-for-proposals-the-scholarly-use-of-web-archives/">Scholarly Access to Web Archives: Progress, Requirements, and Challenges.</a></p>
<p>It would be great to learn about other efforts to plumb researcher interest&#8211;please let us know about them.</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/what-do-researchers-want-from-institutions-that-preserve-digital-content/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Reality Check: What Most People Actually Do with Their Personal Digital Archives</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/reality-check-what-most-people-actually-do-with-their-personal-digital-archives/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/reality-check-what-most-people-actually-do-with-their-personal-digital-archives/#comments</comments>
		<pubDate>Wed, 15 May 2013 13:00:48 +0000</pubDate>
		<dc:creator>Mike Ashenfelder</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[outreach]]></category>
		<category><![CDATA[personal archiving]]></category>
		<category><![CDATA[public libraries]]></category>
		<category><![CDATA[repositories]]></category>
		<category><![CDATA[social media]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14653</guid>
		<description><![CDATA[While Noah Lenstra was working on a website about African-American history in Champaign-Urbana, Illinois, many of the people he met at local public libraries, churches and businesses told him they had personal and family memorabilia they wanted to digitize, or they had digital stuff that they didn&#8217;t know what to do with. Lenstra, a PhD student [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_14660" class="wp-caption alignright" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/lenstra-PDA-2_300.jpg"><img class="size-full wp-image-14660" title="lenstra-PDA-2_300" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/lenstra-PDA-2_300.jpg" alt="" width="300" height="122" /></a><p class="wp-caption-text">Photo courtesy of Noah Lenstra</p></div>
<p>While Noah Lenstra was working on a <a href="http://eblackcu.net/" target="_blank">website about African-American history in Champaign-Urbana, Illinois</a>, many of the people he met at local public libraries, churches and businesses told him they had personal and family memorabilia they wanted to digitize, or they had digital stuff that they didn&#8217;t know what to do with. Lenstra, a PhD student at the University of Illinois Graduate School of Library &amp; Information Science, saw that there was a need in the community for personal digital archiving guidance. So with the help of a grant from the Illinois Humanities Council, he gave a series of much-appreciated public workshops in various cities and towns around the state on the topic of “Digital Local &amp; Family history.&#8221; The workshops yielded a few startling revelations for Lenstra.</p>
<p>Like any good educator, Lenstra not only wanted to share information and confirm the effectiveness of his work, he also wanted to observe the participants to see what he could learn from their actions. So he spent a lot of time working shoulder to shoulder with them, huddled in groups around computers, watching what they did online, asking questions and taking notes.</p>
<p>As he expected, genealogy and local history were popular (they have long been a draw for public libraries) and almost 70% of the attendees told him they were equally interested in both. But he also noticed two striking trends. First, librarians &#8212; particularly front-line staff – were hungry for guidance about personal digital archiving and some traveled great distances to take the workshop. And second, while people in the community accepted his didactic &#8220;eat your vegetables&#8221; personal digital archiving advice (the kind I dole out all the time), they had already organically developed a fun approach on their own that they used all the time: they uploaded their personal archives to Facebook and other social media sites to share it with their family, friends and community. Although a vocal minority of participants had many privacy concerns with the &#8220;social web&#8221; and did not use it, most had started using these tools both to share personal archives with others and to find content in others’ personal archives that documented facets of their own lives.</p>
<div id="attachment_14656" class="wp-caption alignleft" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/Lenstra-PDA-1_300.jpg"><img class="size-full wp-image-14656" title="Lenstra-PDA-1_300" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/Lenstra-PDA-1_300.jpg" alt="" width="300" height="233" /></a><p class="wp-caption-text">Photo courtesy of Noah Lenstra</p></div>
<p>Lenstra didn&#8217;t rely solely on showing PowerPoint slides in his workshops. &#8220;We tried to make the workshops as interactive as possible,&#8221; said Lenstra. &#8220;I introduced them to sites like 1000Memories (now part of ancestry.com) and omeka.net, and had them create accounts. We had hands-on scanning exercises. I also encouraged participants to use the computers to look up things I mentioned during the workshops.&#8221;</p>
<p>But Facebook kept popping up in conversations.</p>
<p>Eventually Lenstra realized that the widespread use of Facebook indicated a need and desire for an easy-to-use tool that would enable users to stay in touch with people in the present and connect with their past and with others who shared that past. &#8220;People were interested in collaborative platforms for sharing some of their personal archives with others,&#8221; said Lenstra. &#8220;They wanted to use their personal archives for purposes that went beyond them as individuals.&#8221;</p>
<p><em>Connection</em> is key to understanding the phenomenon. Lenstra said, “People want to make connections between things they have in their own personal archives and things that may be in other people&#8217;s personal archives, between one person’s history and other people’s histories.&#8221;</p>
<p>Lenstra gave an example of a man whose father played in a local band in the 1960s and 1970s. The son uploaded a photo of his father and his band to a site that had the theme, &#8220;You know you grew up in Urbana-Champaign if you remember&#8230;.&#8221; Lenstra said, &#8220;People who were related to the other members of the group started tagging their own family members. In no time, all the people in the photograph were identified and then other people started sharing <em>their</em> memories of people seeing the band perform. That kind of thing happens all the time, where a personal item quickly becomes something that&#8217;s not just about the family, it&#8217;s about the place where that family lived. In people&#8217;s memories, both the family and the place are bound together.&#8221;</p>
<p>Another fact that Lenstra teased out from the tangle of observations is that, for some people, Facebook has become a convenient digital repository. &#8220;I&#8217;ve heard people say that their computers crashed and they had a lot of photographs lost,&#8221; said Lenstra. &#8220;And they were able to recover many photographs through Facebook because they had put so many of their photos on it. I am not saying that Facebook is a solution, but I think pragmatically for some people it may be a more viable option than taking on the responsibility of backing up their files every five years or so. I remain dubious that people are actually going to do that.&#8221;</p>
<div id="attachment_14661" class="wp-caption alignright" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/lenstra-PDA-3_300.jpg"><img class="size-full wp-image-14661" title="lenstra-PDA-3_300" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/lenstra-PDA-3_300.jpg" alt="" width="300" height="204" /></a><p class="wp-caption-text">Photo courtesy of Noah Lenstra.</p></div>
<p>Setting the workshops in public libraries was a smart choice. Lenstra said that public libraries have a unique role in the community and libraries are increasingly playing a more active role in helping their communities with personal digital matters. About 25 percent of the workshop attendees were staff from public libraries from the cities where the workshops were held and from the surrounding towns.</p>
<p>Librarians are also getting more involved with local history than ever and, by extension, with individual history. Lenstra cites one example of a librarian from a small town in southern Illinois who developed a local history Facebook page. Occasionally she would post photos on Facebook and within a short time people from her social network would identify some of the photos and the people in them. Lenstra said, &#8220;Another time the librarian wanted to purchase a local historical artifact that she found on eBay, so she just posted a message on her page to the effect that the library wanted to purchase it and she was soliciting donations. Within a day, people had pledged enough money to cover the cost. She is enthusiastic about how she &#8212; a public librarian &#8212; can catalyze the attention of people in her community around sharing local and family histories.&#8221;</p>
<p>Lenstra gave an example of how a library in Champaign-Urbana recognized and responded to a digital-preservation need in their community. The library has two publicly accessible scanners intended for patrons to scan copies from the local archives. &#8220;But now, the scanner is used more by people who bring in personal things that they want to scan,&#8221; said Lenstra. &#8220;The library didn&#8217;t plan for this but the scanners are a resource that people want and need, so now the library is advertising the scanners as resources for people to digitize portions of family photographs or whatever. Clearly there is a real opportunity to reinvent this service area.&#8221;</p>
<p>Lenstra’s <a href="http://manual.eblackcu.net/wiki/Main_Page" target="_blank">next workshop is on June 22</a> and he sees it as an occasion to take what he’s learned so far and fine tune the workshop. He&#8217;s still mulling over questions like, &#8220;How can libraries make Facebook work for them?&#8221; and &#8220;Is there something libraries can do or create that fulfills the same role that Facebook now fills?&#8221; Lenstra said, &#8220;We&#8217;re still trying to figure out what is motivating people. What do they get that&#8217;s personally valuable out of some of these social media sites?&#8221;</p>
<p>He plans to meet eventually with library administrations to diagnose what is going on in their libraries and explore ways that libraries could better share local resources, serve their community&#8217;s digital preservation needs and support local and family history. Perhaps they could partner with other local institutions or maybe act as a community digital repository. Lenstra said, &#8220;One of the libraries we worked with had a partnership with the county government and the county government did all of their IT support. That kind of resource-sharing partnership makes new things possible without draining existing resources.&#8221;</p>
<p>But it&#8217;s clear that technology needs to catch up to the way people behave – which partially explains the popularity of social media – and information professionals need to acknowledge how people actually do things. Despite our best intentions and our wishes for more people to practice good personal digital archiving, the number of people who actually backup their files is small compared to the number of people who upload many of those same files to Facebook. This fact is backed up by Microsoft Research&#8217;s Cathy Marshall, an early leader in research on personal digital archiving. <a href="http://www.loc.gov/today/cyberlc/feature_wdesc.php?rec=5581" target="_blank">In a Library of Congress  presentation</a>, she said, &#8220;Nobody really does backup. They secretly hope that someone else is doing backup.&#8221;</p>
<p>Lenstra made a similar observation. He said, &#8220;I don&#8217;t think most people see personal digital archiving as something inherently useful. Many see personal archives as a means to some other kind of end. And so when we do talk about it to them, what kind of language can we use that wouldn&#8217;t be overwhelming or just be perceived as an onerous burden on their time?&#8221;</p>
<p>Lenstra is not suggesting that, despite the general public inaction about &#8220;what you should do,&#8221; we shouldn&#8217;t stop offering personal digital archiving help. People might not backup their files but they should still be aware of good preservation practices. And, in response to how people actually do things, we information professionals should re-think our approach.</p>
<p>Public libraries are the common meeting ground for all of us. As for how to improve the situation, Lenstra said, &#8220;I&#8217;d like to see more public librarians get the support they need. And I&#8217;d like to see more public libraries offer services that help people, so people feel like they don&#8217;t just have to do things on Facebook&#8230;help people understand that there are options or resources in their community to help them preserve their personal digital stuff. I may be idealistic but I&#8217;m hoping that, with support, more public libraries can and will help.&#8221;</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/reality-check-what-most-people-actually-do-with-their-personal-digital-archives/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>The State Electronic Records Initiative: Content Matters Interview</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/the-state-electronic-records-initiative-content-matters-interview/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/the-state-electronic-records-initiative-content-matters-interview/#comments</comments>
		<pubDate>Tue, 14 May 2013 17:28:15 +0000</pubDate>
		<dc:creator>Bill LeFurgy</dc:creator>
				<category><![CDATA[Content Matters Interview]]></category>
		<category><![CDATA[NDSA]]></category>
		<category><![CDATA[Partners and Collaboration]]></category>
		<category><![CDATA[state government information]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14671</guid>
		<description><![CDATA[This is a Guest Post by Abbie Grotke, the Library of Congress Web Archiving Team Lead and Co-Chair of the National Digital Stewardship Alliance Content Working Group. In this installment of the Content Matters interview series of the National Digital Stewardship Alliance Content Working Group, I interview Jim Corridan, President of the Council of State [...]]]></description>
			<content:encoded><![CDATA[<p><em>This is a Guest Post by <strong>Abbie Grotke</strong>, the Library of Congress Web Archiving Team Lead and Co-Chair of the National Digital Stewardship Alliance <a href="http://www.digitalpreservation.gov/ndsa/working_groups/content.html">Content Working Group</a>.</em></p>
<p>In this installment of the <a href="http://blogs.loc.gov/digitalpreservation/category/content-matters-interview/">Content Matters interview series</a> of the <a href="http://www.digitalpreservation.gov/ndsa/working_groups/content.html">National Digital Stewardship Alliance Content Working Group</a>, I interview Jim Corridan, President of the <a href="http://www.statearchivists.org/index.htm">Council of State Archivists</a> and Matt Veatch and Beth Shields, Co-Chairs of the <a href="http://www.statearchivists.org/seri/index.htm">State Electronic Records Initiative</a> Steering Committee about their work.</p>
<div id="attachment_14675" class="wp-caption alignright" style="width: 385px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/5737749250_1f655f1840.jpg"><img class="size-full wp-image-14675" title="5737749250_1f655f1840" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/5737749250_1f655f1840.jpg" alt="Fairfield County 2010 Census Race/Ethnicity Distribution, By uconnlibrariesmagic, on Flickr" width="375" height="477" /></a><p class="wp-caption-text">Fairfield County 2010 Census Race/Ethnicity Distribution, By uconnlibrariesmagic, on <a href="http://www.flickr.com/photos/uconnlibrariesmagic/5737749250/">Flickr</a></p></div>
<p><strong>Abbie:</strong> What is the Council of State Archivists?  What sorts of organizations are involved?</p>
<p><strong>Jim, Matt and Beth:</strong> The Council of State Archivists is the non-profit association comprising the 56 directors of the principal archival agencies in each state and territorial government in the United States. In a majority of states, the state archivists also have responsibility for records management services.</p>
<p><strong>Abbie: </strong>Could you tell us a bit about what you see as the primary value of state archives and government electronic records?</p>
<p><strong>Jim, Matt and Beth:</strong> While not well understood, state archives play a critical role in our democracy. They collect the records &#8212; including electronic records &#8212; that ensure government transparency and accountability; protect the legal, civil and property rights of citizens; promote historically-informed public policy decisions; and preserve essential documentation of the nation&#8217;s history.</p>
<p><strong>Abbie: </strong>How are the electronic records in state archives are being used and by what types of users?</p>
<p><strong>Jim, Matt and Beth:</strong> At this early stage in the effort to collect digital materials, users of state archives electronic records appear to be similar to users of paper records &#8212; attorneys performing legal research, journalists reviewing governmental decisions, scholars investigating community history, teachers preparing lesson plans, students incorporating primary sources into research projects, property owners identifying previous residents of their homes and genealogists seeking family history. As state archives enhance their capacity to collect and make electronic records available online, the number and variety of users will expand and use cases will diversify.</p>
<p><strong>Abbie: </strong>Tell us about the background of the State Electronic Records Initiative &#8212; how did it get started, and what are the goals?</p>
<p><strong>Jim, Matt and Beth:</strong> State archives have long worked to address electronic records effectively and comprehensively. The first state electronic records initiative began in 1979 with a grant to the State Historical Society of Wisconsin. However, state archives have struggled to keep pace with the exponential increase in the volume and complexity of electronic records, particularly in the face of precipitous funding cuts over the last decade. Thus, what was for many years a major concern has now become a crisis.</p>
<p>In July 2011, CoSA launched the State Electronic Records Initiative,  the first comprehensive national effort to improve electronic records preservation in state government, with initial funding provided through Library Services and Technology Act grants from Indiana and Kentucky.  In Phase 1, each state archives and records management program completed a survey about their existing electronic records programs and participated in extended follow-up telephone interviews. Using this baseline data, CoSA established goals for additional SERI activities in four broad areas:</p>
<ul>
<li><strong>Education and training</strong>, to provide state archives staff with opportunities to develop the knowledge and skills required to collect, manage and preserve electronic records.</li>
<li><strong>Awareness</strong>, to raise the level of support and knowledge of the electronic records issue among allied organizations and key stakeholders.</li>
<li><strong>Governance</strong>, to integrate the electronic records management and archives requirements in decisions made during IT planning, procurement, systems development and operations.</li>
<li><strong>Best practices and tools</strong>, to establish a resource center for electronic records standards, tools and policies and to develop pathways to success for strengthening archives and records management programs.</li>
</ul>
<p>The SERI Steering Committee established subcommittees for each of the four areas of emphasis. Forty individuals representing twenty-three state and territorial archives contribute to the work of these very active subcommittees.</p>
<p>CoSA also received federal grant support for SERI activities in two of the four areas of emphasis. An IMLS Laura Bush 21st Century Librarian grant will fund continuing education scholarships and three immersive, week-long electronic records institutes for staff from all 56 state and territorial archives; and an NHPRC grant will support the development of an interactive web portal for electronic records training, tools and standards, designed specifically to the address the needs of government records archivists.</p>
<div id="attachment_14672" class="wp-caption alignright" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/drives.jpg"><img class="size-medium wp-image-14672" title="drives" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/drives-300x225.jpg" alt="Delivering data, by k0a1a.net, on Flickr" width="300" height="225" /></a><p class="wp-caption-text">Delivering data, by k0a1a.net, on <a href="http://www.flickr.com/photos/binary_koala/4100679811">Flickr</a></p></div>
<p><strong>Abbie: </strong>What are the biggest digital preservation challenges faced by state archives and records management programs? Have any of those changed since the original survey took place?</p>
<p><strong>Jim, Matt and Beth:</strong> State archives and records management programs face many of the same digital preservation challenges that other digital stewards are grappling with: identifying and implementing appropriate digital preservation tools and services; selecting the electronic records that warrant long-term preservation; automating ingest workflows; securing technical infrastructure; capturing adequate preservation metadata; adopting sound preservation planning practices; and providing consumer-friendly access to electronic records. A particularly challenging issue for most state archives is developing effective strategies for ensuring that records management and archival requirements receive appropriate consideration during government information system planning, procurement, design and implementation. State archives operate in an environment in which they are responsible for preserving electronic records of enduring value generated by dozens of different government agencies involved in an array of disparate lines of business. The electronic records produced within this government ecosystem are increasingly complex, interrelated and voluminous. If automated records management and archival functions are not embedded in government information systems, state archives will struggle to effectively preserve the American historical record. As noted earlier, state archives are attempting to meet the digital preservation challenge in an era of severe resource scarcity. SERI is designed to provide state archives with tools that will assist them with all of these challenges.</p>
<p><strong>Abbie: </strong>Is there anything that surprised you about the survey results or self-assessments that have been performed as a part of SERI?</p>
<p><strong>Jim, Matt and Beth:</strong> Considering that the electronic records issue has been on the radar screen for many years, it was somewhat surprising to see the relatively limited number of state archives that have implemented extensive electronic records programs. That being said, we should note that nearly all state and territorial archives recognize the importance of developing their electronic records programs and, in the months since the SERI Phase 1 assessment, many states have hired electronic records staff and/or launched new or expanded electronic records initiatives. We hope that upcoming SERI activities &#8212; particularly the electronic records institutes &#8212; will spur additional states and territories to engage digital preservation more aggressively.</p>
<p><strong>Abbie: </strong>Your strategic training and education program sounds incredibly valuable and timely. What sorts of topics will be covered in the training? How many will be trained as a part of this program?</p>
<p><strong>Jim, Matt and Beth:</strong> CoSA is enthusiastic about the potential impact of the IMLS-funded SERI education and training program.  The electronic records institutes kick off this summer with a one-week introductory session for state archives that are in the initial stages of establishing a digital preservation program. The curriculum for this intensive training camp will take a lifecycle approach to electronic records management and digital preservation and covers topics ranging from policy development, advocacy, collaboration and financial sustainability to metadata standards, workflows and trustworthy digital repositories.  In 2014, introductory institute attendees will have the opportunity to participate in one of two advanced electronic records institutes that will explore electronic records and digital preservation in more depth. State archives with more developed electronic records programs also will participate in one of the advanced institutes.  A series of webinars on specific digital preservation topics will round out the training opportunities afforded to electronic records institute participants.</p>
<p>In addition to the institutes, the SERI IMLS grant also provides all 56 states and territories with up to $1000 in grant funds to use for staff training on electronic records management and preservation. State archives can use these continuing education funds for off-site training, on-line training, collaborative regional training, or to bring training on-site.</p>
<p><strong>Abbie: </strong>I noticed on your website that SERI has produced a video. Are there others planned?</p>
<p><strong>Jim, Matt and Beth:</strong> While SERI has posted <a href="http://www.youtube.com/user/StateArchivists">one YouTube video</a> – Mary Beth Herkert, Oregon State Archivist, describing her state’s cloud-based records management system &#8212; we don&#8217;t have specific plans for additional videos. However, the electronic records institutes and work on the interactive web portal may generate opportunities to share our ideas through videos. CoSA is also developing a series of webinars on electronic records, and those too may soon be publicly available.</p>
<p><strong>Abbie: </strong>What is the current focus of work for SERI? Where do you go from here?</p>
<div>
<p><strong>Jim, Matt and Beth:</strong> The IMLS and NHPRC grant projects will continue to be SERI&#8217;s focus for the next 12 to 18 months. Beyond that, we are evaluating strategies for engaging CoSA&#8217;s stakeholders more effectively to enhance digital preservation awareness and expertise within the broader government information management community to develop a national understanding of the challenges and risks faced by electronic records and the potential losses of significant portions of the American story.</p>
<p><del>                                                         </del></p>
<p>What kind of content matters to you? This is but one case for preserving valuable content for long term access.  If you or your institution would like to share your own story of use and long term value of access to a particular type of born-digital resources, please send us a note at <a href="mailto:ndsa@loc.gov">ndsa@loc.gov</a> and in the subject line mark it to the attention of the Content Working Group. We would love to hear from you!</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/the-state-electronic-records-initiative-content-matters-interview/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Hydra&#8217;s Open Source Approach: An Interview with Tom Cramer</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/hydras-open-source-approach-an-interview-with-tom-cramer/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/hydras-open-source-approach-an-interview-with-tom-cramer/#comments</comments>
		<pubDate>Mon, 13 May 2013 16:16:45 +0000</pubDate>
		<dc:creator>Trevor Owens</dc:creator>
				<category><![CDATA[Insights Interview]]></category>
		<category><![CDATA[NDSA]]></category>
		<category><![CDATA[Tools and Infrastructure]]></category>
		<category><![CDATA[open source]]></category>
		<category><![CDATA[open source software]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14344</guid>
		<description><![CDATA[The following interview is a guest post from Karen Cariani, Director of the WGBH Media Library and Archives at WGBH Educational Foundation and Co-Chair for the National Digital Stewardship Alliance Infrastructure Working Group.  Open source software is playing an important role in digital stewardship. In an effort to better understand the role open source software is [...]]]></description>
			<content:encoded><![CDATA[<div id="attachment_14345" class="wp-caption alignright" style="width: 223px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/04/tom-hydra.png"><img class=" wp-image-14345 " title="Tom Cramer" src="http://blogs.loc.gov/digitalpreservation/files/2013/04/tom-hydra-290x300.png" alt="" width="213" height="220" /></a><p class="wp-caption-text">Tom Cramer , Chief Technology Strategist &amp; Associate Director, Digital Library Systems &amp; Services at Stanford University Libraries</p></div>
<p><em>The following interview is a guest post from <strong>Karen Cariani</strong>, Director of the WGBH Media Library and Archives at WGBH Educational Foundation and Co-Chair for the National Digital Stewardship Alliance Infrastructure Working Group. </em></p>
<p>Open source software is playing an important role in digital stewardship. In an effort to better understand the role open source software is playing, the <a href="http://www.digitalpreservation.gov/ndsa/working_groups/infrastructure.html">NDSA infrastructure working group</a> is reaching out to folks working on a range of open source projects. Our goal is to develop a better understanding of their work and how they are thinking about the role of open source software in digital preservation in general.</p>
<p>For background on discussions so far, review our interviews with Bram van der Werf on <a href="http://blogs.loc.gov/digitalpreservation/2012/04/open-source-software-and-digital-preservation-an-interview-with-bram-van-der-werf-of-the-open-planets-foundation/">Open Source Software and Digital Preservation</a>, Peter Van Garderen &amp;  Courtney Mumma on <a href="http://blogs.loc.gov/digitalpreservation/2012/10/archivematica-and-the-open-source-mindset-for-digital-preservation-systems/">Archivematica and the Open Source Mindset for Digital Preservation Systems</a> and Mark Leggott on <a href="http://blogs.loc.gov/digitalpreservation/2013/03/islandoras-open-source-ecosystem-and-digital-preservation-an-interview-with-mark-leggott/">Islandora’s Open Source Ecosystem and Digital Preservation</a>. In this interview, we talk with Tom Cramer, Chief Technology Strategist &amp; Associate Director, Digital Library Systems &amp; Services at Stanford University Libraries.</p>
<p><strong>Karen:</strong>  Could you give us some background on the Hydra project? How did this project come about and what are its goals and objectives?</p>
<p><strong>Tom:</strong> Hydra&#8217;s goals are to combine the power of a repository for enterprise-scale digital asset management and preservation, with tailored interfaces, workflows and access systems specific to different content types and streams&#8211;e.g., articles vs. images vs. time-based media vs. books vs. data. The project started in 2009 when three universities (Hull, Stanford, Virginia) plus Fedora Commons came together to see if they could jointly develop a flexible application framework to complement Fedora. The project motto quickly became &#8220;if you want to go fast, go alone; if you want to go far, go together&#8221;, and we&#8217;ve spent as much energy on building a vibrant and sustainable community as we have on the code.</p>
<div id="attachment_14346" class="wp-caption alignnone" style="width: 595px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/04/hydra-model.png"><img class="size-full wp-image-14346 " title="Major Components of Hydra" src="http://blogs.loc.gov/digitalpreservation/files/2013/04/hydra-model.png" alt="" width="585" height="320" /></a><p class="wp-caption-text">Major Components of Hydra</p></div>
<p><strong>Karen:</strong> Could you tell us a bit about how you and your institution got involved in this project? Further, could you tell us a bit about how your thinking on digital repository platforms has changed and developed over time?</p>
<p><strong>Tom:</strong> In 2008-09, Stanford was re-evaluating the architecture and platform for the Stanford Digital Repository. When we started building the first generation of the system in 2005, we decided to write our own repository from scratch, as we we didn&#8217;t think the existing platforms at the time were good starting points for us. By 2008, we had a much better sense of our needs&#8211;which included collaborative development on a shared platform. We also felt that the repository community, and Fedora in particular, had greatly matured, and would make a serviceable component in our environment for the second-generation SDR. In discussions with colleagues at UVa and Hull, it became clear that we were not the only ones with the same needs, and a joint approach could be highly leveraged.</p>
<p><strong>Karen:</strong> The NDSA infrastructure working group’s exploration of open software is focused on figuring out if there are any inherent benefits to using open source software for parts of an organization’s digital preservation strategy. Do you see any such inherent benefits, and if so what are they?</p>
<p><strong>Tom:</strong>  Yes, I believe there is a substantial benefit around using open source software for digital preservation activities. Using open source software isn&#8217;t necessarily any cheaper than licensing commercial software, but it does give an institution a predictable ongoing cost for maintenance, and more control over direction than with most vendor products. So much of digital preservation is about the sustainability of supporting a system, these are important factors. The transparency that comes with OSS is also a benefit&#8211;with more eyes and more users on a given piece of software, the chances of uncovering latent issues is arguably greater than with proprietary software (though this would only apply in OSS projects with sufficient adopters, of course). Most importantly, though, I think using OSS puts the reins of digital preservation in your institution&#8217;s hands in a way that commercial software does not. In another ten years, when digital preservation is still better understood and there are a number of well-understood, commodity functions, I&#8217;d hope there is a robust and competitive marketplace of commercial solutions. Until then, though, the flexibility and customizability of OSS can provide a more direct path to meeting your preservation needs if they happen to fall outside the current market providers&#8217; product lines.</p>
<p><strong>Karen:  </strong>I am impressed by the commitment of the community to share and work collaboratively to improve.  What do you attribute this to? And how does this improve the project or make it a better model?</p>
<p><strong>Tom:</strong> The pattern of collaboration is baked into Hydra&#8217;s DNA; from the very beginning, the software has been a shared development effort. I think this may be due to the fact that we didn&#8217;t start the code-base by open sourcing a single institution&#8217;s system, nor did we have all the heavy-lifting initially done by grant-funded programmers. We put a lot of time into ongoing communication, including weekly calls, constant IRC, and quarterly face-to-face meetings. That said, I think the biggest draw to aggressive collaboration is the quality of the technical work on Hydra. Developers like working with good developers, and by participating in Hydra collaboratively and sharing their work for re-use, many institutions feel like they&#8217;ve produced their best code.</p>
<div id="attachment_14349" class="wp-caption alignnone" style="width: 595px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/04/4-23-2013-9-43-33-AM.png"><img class=" wp-image-14349 " title="Hydra's Community Model" src="http://blogs.loc.gov/digitalpreservation/files/2013/04/4-23-2013-9-43-33-AM.png" alt="" width="585" /></a><p class="wp-caption-text">Hydra&#8217;s Community Model</p></div>
<p><strong>Karen:</strong> How does the open community work?  What are the guidelines or rules for participating?</p>
<p><strong>Tom:</strong> We go to great lengths to make the community open and supportive. We value working code and welcome constructive engagement on any front: participants can add code or documentation, help in communications or outreach, or simply try to use the software, and ask question when they need help. We also put a lot of energy into training, to bring new community members up to speed. The Hydra Partners (of which there are now 17) are institutions that have each committed to the success of the project overall, not just their local use. We have a lot of coordination, a lot of consensus building, and very little central planning. In short, there aren&#8217;t &#8220;rules&#8221; so much as a community process, and people get out what they put in.</p>
<p><strong>Karen: </strong>What is the best way to keep up on Hydra head development?</p>
<p><strong>Tom:</strong> Joining one of the community email lists, <a href="mailto:hydra-tech@googlegroups.com">hydra-tech@googlegroups.com</a> or <a href="mailto:hydra-announces@googlegroups.com">hydra-announces@googlegroups.com</a>, is the best way to stay abreast of what&#8217;s up. We also do try to keep the website (<a href="http://projecthydra.org">http://projecthydra.org</a>) up to date, but in a project that is as large and fast moving as Hydra is, there is always a little latency.</p>
<p><strong>Karen: </strong>How do you think these projects can become sustainable? And how perhaps does being open help that sustainability vs a licensable vendor supported system?</p>
<p><strong>Tom:</strong> I think any software project&#8211;open source or commercial&#8211;is sustainable when it provides more benefit than cost, and it&#8217;s clear to participants that they get more out of participating/using it than going another route. The great thing about vendors is they provide that focusing lens to translate a community&#8217;s interest (and revenue) into ongoing development and support that benefits the community. Hydra has succeeded so far for the same reasons; it has focused the community&#8217;s efforts around a common approach, and created a framework to enhance and expand it.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/hydras-open-source-approach-an-interview-with-tom-cramer/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Open Data and Preservation</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/open-data-preservation/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/open-data-preservation/#comments</comments>
		<pubDate>Fri, 10 May 2013 16:58:06 +0000</pubDate>
		<dc:creator>Leslie Johnston</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[Tools and Infrastructure]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14636</guid>
		<description><![CDATA[Yesterday, May 9, 2013, the U.S. government issued an executive order and an open data policy mandating that federal agencies collect and publish new datasets in open, machine-readable, and, whenever possible, non-proprietary formats.  The new policy gives agencies six months to create an inventory of all the government-produced datasets they collect and maintain; a list [...]]]></description>
			<content:encoded><![CDATA[<p>Yesterday, May 9, 2013, the U.S. government issued an <a href="http://cdn.govexec.com/media/gbc/docs/pdfs_edit/050913jm1.pdf">executive order</a> and an <a href="http://www.whitehouse.gov/sites/default/files/omb/memoranda/2013/m-13-13.pdf">open data policy</a> mandating that federal agencies collect and publish new datasets in open, machine-readable, and, whenever possible, non-proprietary formats.  The new policy gives agencies six months to create an inventory of all the government-produced datasets they collect and maintain; a list of datasets that are publicly accessible; and an online system to collect feedback from the public as to how they would like to use the data.  The goals are twofold &#8212; greater access to government data for the public, and the availability of data in forms that businesses and researchers can better use.  This builds on the earlier <a href="http://www.whitehouse.gov/the_press_office/TransparencyandOpenGovernment">White House Memorandum on Transparency and Open Government</a>.</p>
<div id="attachment_14642" class="wp-caption alignright" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/data.jpg"><img class="size-medium wp-image-14642" title="Data" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/data-300x289.jpg" alt="Data" width="300" height="289" /></a><p class="wp-caption-text">Data</p></div>
<p>These documents were accompanied by a link to something that actually caught my fancy even more &#8211; a greatly expanded <a href="http://project-open-data.github.io/">Project Open Data Github repository</a> for guidelines, use cases and tools.  This, alongside the ever-growing (and soon to be extensively updated) <a href="http://www.data.gov/">data.gov</a>, are evidence of real efforts to release more data and make it truly useful and usable.</p>
<p>The documents provide guidance on open licensing, metadata, and standards, as well as lifecycle-based information stewardship. But what I personally keep struggling with are two questions: What IS open data? And how is is being preserved?</p>
<p>The project has some <a href="http://project-open-data.github.io/principles/">defining principles for open data</a> that I think can inform any dataset preservation project.  While reading through some of the documents, I came across this bullet point:</p>
<ul>
<li><em>Managed Post-Release.</em> A point of contact must be designated to assist with data use and to respond to complaints about adherence to these open data requirements.</li>
</ul>
<p>I am thrilled to see guidance about active management of datasets and supporting users in their work with the data.  But what could be available for this and all open dataset projects is more attention on dataset preservation.  These are a few of some great resources on this topic:</p>
<ul>
<li><a href="http://www.digitalpreservation.gov/formats/content/dataset_quality.shtml">The Library of Congress Sustainability of Digital Formats site on datasets</a></li>
<li><a href="http://archives.govt.nz/resources/information-management-research/preservation-public-sector-datasets">A Report on the Preservation of Public Sector Datasets from Archives New Zealand</a></li>
<li><a href="http://blog.nationalarchives.gov.uk/blog/open-data-and-archiving-datasets/">Open Data and Archiving Datasets from the National Archives UK</a></li>
<li><a href="http://www.dataone.org/best-practices/preserve">DataONE</a></li>
<li><a href="http://www.data-pass.org/">Data-PASS</a></li>
<li><a href="http://dataconservancy.org/">DataConservancy</a></li>
<li><a href="http://www.icpsr.umich.edu/icpsrweb/content/datamanagement/life-of-dataset.html">Life of a Dataset from ICPSR</a></li>
<li><a href="http://www.geomapp.net/docs/GIS_Archival_Processing_Process_v1.0_final_20111102.pdf">Best Practices for Archival Processing for Geospatial Datasets from the GeoMAPP Project</a></li>
<li><a href="http://wiki.opf-labs.org/display/REQ/Datasets,+Issues,+Contexts+and+Solutions">Datasets, Issues, Contexts and Solutions from the Open Planets Foundation</a></li>
</ul>
<p>Do public sector datasets present different issues for preservation from other datasets?  Not really.  They definitely have a potentially much higher level of public scrutiny and use.  But they have the same level of investment of time and money in their creation, serve the research and public good, and present the same format preservation issues as other research data.</p>
<p>&nbsp;</p>
<p>&nbsp;</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/open-data-preservation/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Fifty Digital Preservation Activities You Can Do</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/fifty-digital-preservation-activities-you-can-do/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/fifty-digital-preservation-activities-you-can-do/#comments</comments>
		<pubDate>Thu, 09 May 2013 21:28:12 +0000</pubDate>
		<dc:creator>Susan Manus</dc:creator>
				<category><![CDATA[Digital Content]]></category>
		<category><![CDATA[Personal Archiving]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[interns]]></category>
		<category><![CDATA[personal archiving]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14582</guid>
		<description><![CDATA[The following is a guest post by Tess Webre, former intern with NDIIPP at the Library of Congress Preservation Week 2013 might be over, but digital preservation must go on every week of the year. In truth, preservation is an ongoing, long lasting process that requires active management. Don&#8217;t despair, though. I have some helpful suggestions to [...]]]></description>
			<content:encoded><![CDATA[<p><em>The following is a guest post by <strong>Tess Webre</strong>, former intern with NDIIPP at the Library of Congress</em></p>
<p>Preservation Week 2013 might be over, but digital preservation must go on every week of the year. In truth, preservation is an ongoing, long lasting process that requires active management. Don&#8217;t despair, though. I have some helpful suggestions to help keep you in the preservation-y mood until next year.</p>
<div id="attachment_14630" class="wp-caption alignright" style="width: 262px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/50x.jpg"><img class="size-medium wp-image-14630" title="50x" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/50x-252x300.jpg" alt="Polk Motel's other sign, by SeeMidTN.com (aka Brent), on Flickr" width="252" height="300" /></a><p class="wp-caption-text">Polk Motel&#8217;s other sign, by SeeMidTN.com (aka Brent), on <a href="http://www.flickr.com/photos/brent_nashville/2273223084/">Flickr</a></p></div>
<ol>
<li>Find an online stewardship project that appeals to you and give it some time.  Try going <a href="http://www.crowdsourcing.org/">here</a> for insight into new projects. At a recent <a href="http://www.meetup.com/Digital-Cultural-Heritage-DC/">Digital Cultural Heritage DC</a> meeting, members of a prestigious museum said that crowdsourcing in the past few years had accomplished as much as 14 full time positions, so know that you are making a difference.</li>
<li>Locate a digital repository close to you; find out what activities/projects they are undertaking and use social media to promote work that you like.</li>
<li>Follow NDIIPP on <a href="https://twitter.com/ndiipp">Twitter</a></li>
<li>Like NDIIPP on <a href="https://www.facebook.com/digitalpreservation">Facebook</a></li>
<li><a href="http://www.loc.gov/rss/#preservation">Subscribe</a> to the monthly Library of Congress Digital Preservation Newsletter via email.</li>
<li>Volunteer at a local repository.  Call one up and see if they need help or supplies. They are usually quite nice, in my experience.</li>
<li>Write some blog posts to increase public awareness for libraries and repositories.</li>
<li>Do some personal digital estate planning. (<a href="logs.loc.gov/digitalpreservation/2012/01/the-big-digital-sleep/">Here are some insights</a>.)</li>
<li>Take a stab at some digital disaster planning. (<a href="http://blogs.loc.gov/digitalpreservation/2013/02/after-the-flood-digital-art-recovery-in-the-wake-of-hurricane-sandy/">Here are some more insights</a>.)</li>
<li>Visit the <a href="http://www.eyebeam.org/">Eyebeam Gallery</a> and marvel at how they were able to salvage so many of their digital assets that had been damaged in the flooding of Hurricane Sandy.</li>
<li>Take some time to do inventory on your storage media. Does it need an update? Here is one of the best <a href="http://www.digitalpreservation.gov/personalarchiving/documents/media_durability.pdf">guides on the topic.</a></li>
<li>Do some double checks on your cloud storage. Everything accounted?</li>
<li>Migrate proprietary files to an open sourced format.</li>
<li>Organize your email. (Really, it&#8217;s worth considering.)</li>
<li>Do some backups of your smart phone, tablet, digital camera and other mobile devices.</li>
<li>Make sure your digital photographs are accurately described.</li>
<li>Talk to your children about the need for good digital preservation skills.</li>
<li>Read your kids a digital preservation fairy tale (<a href="http://blogs.loc.gov/digitalpreservation/2013/03/snow-byte-and-the-seven-formats-a-digital-preservation-fairy-tale/">This one is also good for adults</a>.)</li>
<li>Write out your worst personal digital data loss story. Reflect on it.</li>
<li>Take a look at some of the Library of Congress’ <a href="http://www.youtube.com/playlist?list=PLEA69BE43AA9F7E68&amp;feature=plcp">many excellent videos</a> that discuss the challenges and solutions facing digital archivists. I personally love this one of <a href="http://www.youtube.com/watch?v=nZFg-uq5zBA&amp;list=ECEA69BE43AA9F7E68&amp;index=17">Cory Doctorow.</a></li>
<li>Watch a film that deals with preservation (Keanu Reeves just came out with a documentary  discussing <a href="http://sidebysidethemovie.com/">digital movies</a>.)</li>
<li>Read NDIIPP’s new collection of <a href="http://www.digitalpreservation.gov/documents/ebookpdf_march18.pdf">digital preservation perspectives</a>, it has many ideas and tips.</li>
<li>Take some time to brush up on your Spanish by reading this <a href="http://www.library.cornell.edu/preservation/tutorial-spanish/contents.html">tutorial on digitization</a> of images from Cornell University Library.</li>
<li>Or, if you don’t speak Spanish, brush up on your French, by reading this <a href="http://www.dpworkshop.org/dpm-french/index.html">digital preservation</a> management planning guide from University of Michigan.</li>
<li>It’s also available in <a href="http://www.dpworkshop.org/dpm-ita/index.html">Italian</a>.</li>
<li>Or take the time to learn a programming language. It’s extremely satisfying and not that difficult to learn.  Just start out with a “hello world” and you’re golden.</li>
<li>If that doesn’t appeal to you, take some time to learn a metadata schema such as Dublin Core, METS or others.</li>
<li>Pick a <a href="http://blogs.loc.gov/digitalpreservation/2013/04/mascots-for-team-digipres/">digital preservation mascot </a>to keep you motivated.</li>
<li>Take a look at the new <a href="http://dp.la/">Digital Public Library of America</a>. It’s quite awesome, and has some fantastic exhibits.</li>
<li>Check out the preservation of video games by <a href="http://pvw.illinois.edu/pvw/">going here.</a></li>
<li>Or take some time to look at the preservation of geospatial data, by <a href="http://www.geomapp.net/">going here.</a></li>
<li>Take a peek at how the internet used to look by visiting the <a href="http://archive.org/web/web.php">Wayback Machine.</a></li>
<li>Take a gander at <a href="http://archive-it.org/">Archive-It</a> and one of its many exhibits.</li>
<li>Or just take a look at all of the Internet Archive’s many <a href="http://archive.org/projects/">wonderful projects</a>.</li>
<li>Also, take a look at the Library of <a href="http://lcweb2.loc.gov/diglib/lcwa/html/lcwa-home.html">Congress’ Minerva collection</a>.</li>
<li>Or, check out the <a href="http://www.webarchive.org.uk/ukwa/">British Web Archive’s</a> many exhibits from across the pond.</li>
<li>Or, see how they do web archives down under, by looking at <a href="http://pandora.nla.gov.au/">Australia’s web archive Pandora</a>.</li>
<li>Or, take a look at <a href="http://www.webarchiv.cz/">the Czech Republic’s</a> web archive.</li>
<li>Finally, take a moment to view <a href="http://arquivo.pt/?l=pt">the Portuguese</a> web archive.</li>
<li>Organize an event with a local club to increase awareness of digital preservation initiatives. Remember to serve cake, people will come if there is cake.</li>
<li>Attend a conference, or lecture discussing digital preservation. There is sometimes cake.</li>
<li>Distribute digital preservation leaflets, or guides. Some are available <a href="http://www.digitalpreservation.gov/personalarchiving/documents/PA_All_brochure.pdf">here</a>.</li>
<li>Eat strawberry ice cream with sprinkles, hot fudge, and a maraschino cherry on top. It doesn’t really have anything to do with digital preservation, but if you made it this far down the list, you deserve a treat. Also, this is the best combination of ice cream flavors…according to science.</li>
<li>Watch one of these fantastic short videos on digital preservation and root <a href="http://www.youtube.com/user/wepreserve?feature=watch">for Digiman</a>!</li>
<li>Check your personal digital archiving knowledge by taking <a href="http://blogs.loc.gov/digitalpreservation/2013/02/personal-digital-archiving-how-well-do-you-score/">this fun quiz</a>.</li>
<li>Didn’t do so well? Try giving yourself <a href="http://blogs.loc.gov/digitalpreservation/2013/03/death-taxes-digital-audits-and-puppies/">a personal digital archiving audit</a> (bonus, there are pictures of puppies.)</li>
<li>Take a look at these <a href="http://www.digitalpreservation.gov/personalarchiving/photos.html">personal digital archiving tips.</a></li>
</ol>
<p>And of course:  48. Identify;  49. Decide;  50. Organize;  51. Make Copies.</p>
<p>Oh dear, that&#8217;s 51. Sorry, I got carried away with my enthusiasm for the subject. Anyway, this list should keep you busy until next year’s preservation week. Have fun and let us know if there are other things you think should be on the list.</p>
<p>Until next time, I wish you all safe data.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/fifty-digital-preservation-activities-you-can-do/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Ian MacKaye and Citizen Archiving</title>
		<link>http://blogs.loc.gov/digitalpreservation/2013/05/ian-mackaye-and-citizen-archiving/</link>
		<comments>http://blogs.loc.gov/digitalpreservation/2013/05/ian-mackaye-and-citizen-archiving/#comments</comments>
		<pubDate>Wed, 08 May 2013 17:08:02 +0000</pubDate>
		<dc:creator>Butch Lazorchak</dc:creator>
				<category><![CDATA[Outreach and Events]]></category>
		<category><![CDATA[Personal Archiving]]></category>
		<category><![CDATA[collecting]]></category>
		<category><![CDATA[Dischord]]></category>
		<category><![CDATA[Fugazi]]></category>
		<category><![CDATA[Ian MacKaye]]></category>
		<category><![CDATA[Minor Threat]]></category>
		<category><![CDATA[music]]></category>
		<category><![CDATA[personal archiving]]></category>
		<category><![CDATA[punk rock]]></category>

		<guid isPermaLink="false">http://blogs.loc.gov/digitalpreservation/?p=14564</guid>
		<description><![CDATA[I think of “citizen archivists” as the first responders of history, arriving early on the scene to gather, capture, describe and preserve ephemeral artifacts of interest and helping to ensure that they survive over time to share with the future. Thoughts on citizen archivists and their importance to institutions like ours were running through my [...]]]></description>
			<content:encoded><![CDATA[<p>I think of “citizen archivists” as the first responders of history, arriving early on the scene to gather, capture, describe and preserve ephemeral artifacts of interest and helping to ensure that they survive over time to share with the future.</p>
<div id="attachment_14566" class="wp-caption alignleft" style="width: 310px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/mackaye-johnston1.jpg"><img class="size-medium wp-image-14566" title="mackaye-johnston1" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/mackaye-johnston1-300x225.jpg" alt="Ian MacKaye of Dischord Records speaking at the Library of Congress. Photo credit: Leslie Johnston" width="300" height="225" /></a><p class="wp-caption-text">Ian MacKaye of Dischord Records speaking at the Library of Congress. Photo credit: Leslie Johnston</p></div>
<p>Thoughts on citizen archivists and their importance to institutions like ours were running through my head last night as <a href="http://www.loc.gov/today/pr/2013/13-080.html">we hosted</a> Ian MacKaye of the independent label <a href="http://www.dischord.com/">Dischord Records</a> in a presentation here at the Library.</p>
<p>MacKaye founded Dischord as a teenager with partner Jeff Nelson in 1980, and he and Nelson went on to form Minor Threat, a group that, along with Bad Brains, has been credited with introducing the <a href="http://en.wikipedia.org/w/index.php?title=Hardcore_punk&amp;oldid=553996712#Washington.2C_D.C.">DC hardcore ethic</a> to an audience beyond Washington.</p>
<p>As a performer, producer and enthusiastic supporter, MacKaye has documented music coming out of the Washington, D.C. underground for the past 30 years. Much of the music and art he’s compiled is not being comprehensively addressed by major collecting institutions, but that’s not unusual.</p>
<p>There’s often a gap between an activity and the entrance of its artifacts into the halls of culture, even while the material may have <a href="http://www.nyu.edu/pubs/counterblast/punk.htm">long demonstrated cultural and economic merit</a>. There are complexities to how popular cultural and folk arts ultimately get embraced by cultural heritage organizations, but that gap in time is a key concern to the digital stewardship community. The concern is that valuable materials will be lost through the ravages of time without intervention, supporting the idea that we need to creatively engage with citizen archivists to help identify important materials early in their lifecycle and to assist in their long-term care.</p>
<p>Richard Cox touches on these issues in an interesting 2009 paper, <a href="http://d-scholarship.pitt.edu/2692/">Digital Curation and the Citizen Archivist</a>. In it he provides a conceptual argument for how the current interest in <a href="http://blogs.loc.gov/digitalpreservation/2011/08/digital-preservation-digital-curation-digital-stewardship-what%e2%80%99s-in-some-names/">digital curation</a> might guide professional archivists as they embrace partnerships with private collectors to capture materials while concurrently developing training programs to assist private citizens in how to preserve, manage and use digital personal and family archives.</p>
<p>The evolving nature of personal digital archiving is described succinctly in this passage:</p>
<blockquote><p>In the past, while there has always been tension between private and public (institutional) collectors, it has been the institutional collectors –archives, libraries, museums, and historic sites – that have won out. In the future, there may be less certainty about this, especially as so many personal papers are digitally born and pose challenges to the public archives. The good news is, however, many private citizens care as passionately about the documents as do the institutional repositories.</p></blockquote>
<p>As Cox notes, “personal collecting can seem quirky or frivolous, but it always reveals some deeper inner meaning to life’s purpose.” The “citizen archivist” takes collecting to another level by engaging with materials in a deeper, more systematic way.</p>
<p>We’ve tried to address the need for guidance with our <a href="http://www.digitalpreservation.gov/personalarchiving/">personal digital archiving</a> materials and the National Archives has a <a href="http://www.archives.gov/citizen-archivist/">program for engaging with citizen archivists</a>, but more work needs to be done, along the lines of what Cox is exploring, on how to help train the people who, as private citizens, will be caring for some portion of our future digital collections, whether they recognize that they’re &#8220;citizen archivists” or not.</p>
<p>MacKaye doesn’t really consider himself a citizen archivist, but the work that he’s been doing over the past thirty years goes beyond the level of mere collecting and provides models for creative ways to gather and provide access to archival materials.</p>
<div id="attachment_14567" class="wp-caption alignleft" style="width: 303px"><a href="http://blogs.loc.gov/digitalpreservation/files/2013/05/Fugazi-Live-Series_-dallas-page-for-blog.png"><img class="size-medium wp-image-14567" title="Fugazi Live Series_ dallas-page-for-blog" src="http://blogs.loc.gov/digitalpreservation/files/2013/05/Fugazi-Live-Series_-dallas-page-for-blog-293x300.png" alt="Screenshot from the Fugazi Live Series of a Notorious Dallas Concert in 1990." width="293" height="300" /></a><p class="wp-caption-text">Screenshot from the Fugazi Live Series of a Notorious Dallas Concert in 1990.</p></div>
<p>After his time in Minor Threat he helped form the band Fugazi, who released seven albums and played more than 1,000 concerts in 50 states and several foreign countries between 1986 and 2003. The band’s sound engineers recorded more than 800 of these shows.</p>
<p>“I’d say it was for posterity, but to what end, we had no idea,” Mr. MacKaye said in a <a href="http://www.nytimes.com/2011/11/26/arts/music/fugazi-live-series-a-post-punk-bands-archive-of-shows.html?_r=0">New York Times article</a> in 2011. “As with a lot of collections, once we had a couple hundred tapes, we just continued to amass them. Why stop? We’d already gotten this far.”</p>
<p>Almost any collection becomes interesting once you <a href="http://www.neatorama.com/2008/05/14/neatoramas-guide-to-25-of-the-strangest-collections-on-the-web/">get enough stuff in one place,</a> but there was obvious cultural and historic value to a collection of Fugazi live concerts. The label started the Fugazi Live Series in 2004 and moved it to the web in 2011 to eventually make available a complete archive of the Fugazi concert experience.</p>
<p>One of the many interesting things about the Fugazi live series is how the label designed the web site to bring the audience into the equation, supporting user submissions of photographs, fliers and other ephemera related to each event, deepening the engagement while offering an opportunity for the audience to share their personal experiences and stories.</p>
<p>Citizen archivist’s proximity to events also allows them to be more proactive about making materials available. As <a href="http://www.spin.com/articles/fugazi-ian-mackaye-library-of-congress-lecture-punk-archive/">MacKaye said last night</a>, “somewhere down the road, some kid very much like me will be interested in what was happening during this time. Because, most of the time, what was happening in the past has always been curated by the mainstream media industry. They&#8217;re the ones that decided about the history of rock.&#8221;</p>
<p>Whether we recognize it or not, we’re surrounded by citizen archivists. The challenge for us in the information professions is to find creative ways to support and channel the wonderful energy going on in the wider world to ensure that diverse materials of incredible value survive over time.</p>
<p>P.S. There are more photographs of the event on our <a href="https://www.facebook.com/media/set/?set=a.600330936643670.1073741826.116070868403015&amp;type=1">Facebook page</a> and we welcome your submissions. The event was recorded and will be made available for webcast shortly. Follow our <a href="https://twitter.com/ndiipp">Twitter feed</a> for the latest information.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.loc.gov/digitalpreservation/2013/05/ian-mackaye-and-citizen-archiving/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
	</channel>
</rss>
