Top of page

Category: Open Data

Bar graph and line graph

New U.S. Elections Web Archive Data Resources Available

Posted by: Tracee Haupt

This blog post was guest-authored by Rachel Trent, Senior Digital Collections Data Librarian. For nearly twenty-five years, the Library of Congress has been archiving campaign websites for Presidential, Congressional, and gubernatorial elections. Back in 2022, we released a dataset of index files for the United States Elections Web Archive, and we are happy to announce …

An abstract design of dots and the name of the event LC Collections as Data Concluding Computing Cultural Heritage in the Cloud

Computational Approaches to Library of Congress Collections as Data – Concluding the CCHC initiative

Posted by: Laurie Allen

Please join us as we conclude the Computing Cultural Heritage in the Cloud (CCHC) grant, awarded to the Library of Congress in 2019 by the Mellon Foundation. At the event, we will describe the lessons of the grant, designed to help us investigate a model for enabling discovery, investigation, and visualization of Library materials in …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Candidates, Campaigns, and CDX Files: A New United States Elections Web Archive Dataset

Posted by: Tracee Haupt

This dataset has been re-released as a data package on data.labs.loc.gov/packages. Please see the 2024 blog post announcing its release for more information and updated links.   This blog post was co-authored by Chase Dooley (Senior Digital Collections Specialist) and Tracee Haupt (Digital Collections Specialist), members of the Library’s Web Archiving Team. The Library’s Web …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Computing Cultural Heritage in the Cloud: An Interview with Victoria Scheppele

Posted by: Leah Weinryb-Grohsgal

We are delighted to introduce Victoria (Tori) Scheppele, a Library Technician in the Prints & Photographs Division who has joined us temporarily to work on the Computing Cultural Heritage in the Cloud (CCHC) initiative. The CCHC initiative is supported by a generous grant from the Andrew W. Mellon Foundation. Centered in LC Labs, the project …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

It’s a bird, it’s a plane, it’s a…derivative dataset!

Posted by: Eileen J. Manchester

This post describes a collaboration between LC Labs member Eileen J. Manchester and Peter DeCraene, the Albert Einstein Distinguished Educator Fellow to answer the question: "what would it mean to treat a dataset as a primary source?"

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Reflecting On a Year of Selected Datasets

Posted by: Pedro Gonzalez-Fernandez

Introduction The Selected Datasets Collection was publicly launched June 2020 as part of the Library’s ongoing efforts to support emerging data-driven styles of research. Since then, our initial offering of twenty datasets has grown to nearly 200 unique items, and we’ve continued to refine the technical workflows by which content is prepared and delivered to …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Sparking the Datamagination: 2021 Digital Strategy Summer Intern Design Sprint part II

Posted by: Eileen J. Manchester

This is an interview with Maria Capecchi, Abigail Tick, and Joshua Ortiz Baco, three of the seven students that joined our team during the summer of 2021. As a small group, they worked together to better understand the Newspaper Navigator data set with the needs of undergraduate students in mind.