Opening Up the National Digital Newspaper Program

The following is a guest post by David Brunton, a Supervisory Information Technology Specialist in the Library of Congress Office of Strategic Initiatives.

The National Endowment for the Humanities and the Library of Congress have partnered to enhance access to historic newspapers for many years with the National Digital Newspaper Program.  A centerpiece of this partnership is the Chronicling America website.  At over six million pages from over thirty states, the program meets this commitment by publishing historic newspapers on the web.

The software that runs this centerpiece is developed in the Library of Congress’s Repository Development Center, and it is called chronam.  It is available for anyone to use: http://github.com/LibraryofCongress/chronam/. From the project README:

“The idea of making chronam available here on Github is to provide a technical option to these awardees, or other interested parties who want to make their own websites of NDNP newspaper content available.”

Around this release, we added a large number of features, and fixed some bugs as well:

  • look and feel can be easily customized
  • database size has been decreased by over 90%
  • search URLs are more cache-friendly
  • word coordinates are saved to the filesystem and delivered compressed
  • much, much more

The customizability is illustrated with the two side-by-side screenshots requiring only a single line change in a configuration file.  On the left is our default for the Library of Congress website, and on the right is a generic view without any Library of Congress branding.

Click to enlarge.

Click to enlarge.

 

 

 

 

 

 

 

 

 

We created a public mailing list, for talking about the software, and we began to publicize our work with the NDNP awardees.  We are now sharing it more widely, in the hopes of furthering the mission to enhance access to historic newspapers.

4 Comments

  1. Janell
    April 18, 2013 at 7:20 am

    Can this be used by other organizations who are not NDNP awardees?

  2. Janell
    April 18, 2013 at 7:26 am

    To clarify, it sounds like it’s open to all, but then I’m not sure what you mean by “NDNP newspaper content” in the quote above.

  3. David Brunton
    April 18, 2013 at 8:58 pm

    Janell, it is open to all who want to use it. That includes, interestingly enough, not only the software, but also the content!

  4. margaret scuderi
    March 6, 2014 at 8:02 am

    does not allow for a name search- john smith will present every john and every smith in the paper which is USELESS FOR RESEARCH- Northen NY library had a wonderful site that allowed “john smith” and that is what you got- by using the ” x” you got the first and last name of the person- the northen group joined a group 3Rs and they go thru Library of Congress- and I was told that is unfortunately the way it works-also- the site is very slow to populate as one must bring up the page of the paper and wait to even read the name- this can be fixed if tax dollars fund this site make it usable

Add a Comment

This blog is governed by the general rules of respectful civil discourse. You are fully responsible for everything that you post. The content of all comments is released into the public domain unless clearly stated otherwise. The Library of Congress does not control the content posted. Nevertheless, the Library of Congress may monitor any user-generated content as it chooses and reserves the right to remove content for any reason whatever, without consent. Gratuitous links to sites are viewed as spam and may result in removed comments. We further reserve the right, in our sole discretion, to remove a user's privilege to post content on the Library site. Read our Comment and Posting Policy.

Required fields are indicated with an * asterisk.