Top of page

Search results for: datasets

A New Resource to Explore Library of Congress Transcription Datasets

Posted by: Carlyn Osborn

Today’s guest post is from Madeline Goebel, a Digital Collections Specialist at the Library of Congress. As a reader of the Signal, you may already be familiar with By the People, the Library of Congress’s crowdsourcing program that allows volunteers to transcribe, review, and tag digitized pages from the Library’s collections. Further, you may already know …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Reflecting On a Year of Selected Datasets

Posted by: Pedro Gonzalez-Fernandez

Introduction The Selected Datasets Collection was publicly launched June 2020 as part of the Library’s ongoing efforts to support emerging data-driven styles of research. Since then, our initial offering of twenty datasets has grown to nearly 200 unique items, and we’ve continued to refine the technical workflows by which content is prepared and delivered to …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Selected Datasets: A New Library of Congress Collection

Posted by: Pedro Gonzalez-Fernandez

Friends, data wranglers, lend me your ears; The Library of Congress’ Selected Datasets Collection is now live! You can now download datasets of the Simple English Wikipedia, the Atlas of Historical County Boundaries, sports economic data, half a million emails from Enron, and urban soil lead abatement from this online collection. This initial set of …

Orange and cream image of a tree with different radio call signs.

What’s New Online at the Library of Congress: December 2024

Posted by: Carlyn Osborn

Interested in learning more about what’s new in the Library of Congress’s digital collections? The Signal shares updates on new additions to our digital collections and we love showing off all the hard work of our colleagues from across the Library. Read on for a sample of what’s been added recently and some of our favorite highlights. …

Bar graph and line graph

New U.S. Elections Web Archive Data Resources Available

Posted by: Tracee Haupt

This blog post was guest-authored by Rachel Trent, Senior Digital Collections Data Librarian. For nearly twenty-five years, the Library of Congress has been archiving campaign websites for Presidential, Congressional, and gubernatorial elections. Back in 2022, we released a dataset of index files for the United States Elections Web Archive, and we are happy to announce …

Black and white photograph depicting a woman in a long dress crouching by a wall of card catalog file cabinets in the Library's Main Reading Room. She is pulling open a drawer from one of the cabinets.

Could Artificial Intelligence Help Catalog Thousands of Digital Library Books? An Interview with Abigail Potter and Caroline Saccucci

Posted by: Isabel Brador

Catalog records are key to storing and finding digital library materials. As the volume of digital materials continues to grow rapidly, the Library of Congress is exploring whether AI can help catalogers by automating the generation of metadata. AI could provide an opportunity to speed up description workflows. Yet there are numerous machine learning (ML) …