A detailed analysis of the Copyright records has been completed, and test scanning has been done to determine the best digitization parameters for the several formats of the records. For optimal preservation, the records will be scanned in uncompressed tagged image file format (TIFF) at a minimum of 300 pixels per inch (ppi) in 24 bit color. For routine access to the digitized records, derivative files will be created in high quality JPEG and JPEG2000 format at 50:1 compression. Production scanning has begun on three sets of records.
First, the 2.5 million catalog cards which constitute the indexes to assignments and transfers of copyrights from 1870 to 1977 have been digitized and the images placed in archival storage in our data center and at a backup site. Four Metadata Specialists have been working part-time to capture index terms from the assignment and transfer card images in local databases as a prototype for capturing data to create a publicly available online index to the digitized records. Data has been captured from more than 126 thousand images.
Second, digitization of the 7.7 million registration catalog cards from the period 1971 to 1977 has been completed, and over 2.4 million cards have been digitized so far from the 1955 to 1970 period. Work is continuing on the card scanning.
Third, the 660 bound volumes of the Catalogs of Copyright Entries are being scanned at the Internet Archive center in the Library of Congress Adams Building. This is the same center and process being used to scan works from the Library’s collections. 417 volumes have been digitized so far including registrations and renewals from 1936 to 1977. They are available at http://www.archive.org/details/copyrightrecords/ with a limited search capability based on the results of optical character recognition (OCR) of the scanned text. This work will require another year to complete.
The focus now is on how to index the records and make them widely available via the web. This is the central purpose of this blog. I and others will periodically post information about what we’re working on and seek input from you about how you think we should proceed. Your input is very important to us so that we can build the very best system to meet your needs for copyright information. If you have interest in copyrights and in seeing the records made more accessible, then please follow our posts and provide us with your feedback and comments.