Top of page

Category: Open Data

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Candidates, Campaigns, and CDX Files: A New United States Elections Web Archive Dataset

Posted by: Tracee Haupt

This blog post was co-authored by Chase Dooley (Senior Digital Collections Specialist) and Tracee Haupt (Digital Collections Specialist), members of the Library’s Web Archiving Team. The Library’s Web Archiving Team recently released a derivative dataset that describes the United States Elections Web Archive, a collection that preserves over twenty years of campaign websites for candidates …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Computing Cultural Heritage in the Cloud: An Interview with Victoria Scheppele

Posted by: Leah Weinryb-Grohsgal

We are delighted to introduce Victoria (Tori) Scheppele, a Library Technician in the Prints & Photographs Division who has joined us temporarily to work on the Computing Cultural Heritage in the Cloud (CCHC) initiative. The CCHC initiative is supported by a generous grant from the Andrew W. Mellon Foundation. Centered in LC Labs, the project …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

It’s a bird, it’s a plane, it’s a…derivative dataset!

Posted by: Eileen J. Manchester

This post describes a collaboration between LC Labs member Eileen J. Manchester and Peter DeCraene, the Albert Einstein Distinguished Educator Fellow to answer the question: "what would it mean to treat a dataset as a primary source?"

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Reflecting On a Year of Selected Datasets

Posted by: Pedro Gonzalez-Fernandez

Introduction The Selected Datasets Collection was publicly launched June 2020 as part of the Library’s ongoing efforts to support emerging data-driven styles of research. Since then, our initial offering of twenty datasets has grown to nearly 200 unique items, and we’ve continued to refine the technical workflows by which content is prepared and delivered to …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Sparking the Datamagination: 2021 Digital Strategy Summer Intern Design Sprint part II

Posted by: Eileen J. Manchester

This is an interview with Maria Capecchi, Abigail Tick, and Joshua Ortiz Baco, three of the seven students that joined our team during the summer of 2021. As a small group, they worked together to better understand the Newspaper Navigator data set with the needs of undergraduate students in mind.

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Diving into Branch Rickey: Using a dataset of crowdsourced transcriptions as a tool for open research

Posted by: Carlyn Osborn

Today’s blog post is from Abby Shelton and Lauren Seroka, two Digital Collections Specialists in the Digital Content Management Section here at the Library of Congress. Abby and Lauren discuss their work with the University of Michigan School of Information’s Ann Arbor Data Dive earlier this year. On March 27, 1956, Branch Rickey wrote of baseball …