Top of page

Search results for: datasets

A New Resource to Explore Library of Congress Transcription Datasets

Posted by: Carlyn Osborn

Today’s guest post is from Madeline Goebel, a Digital Collections Specialist at the Library of Congress. As a reader of the Signal, you may already be familiar with By the People, the Library of Congress’s crowdsourcing program that allows volunteers to transcribe, review, and tag digitized pages from the Library’s collections. Further, you may already know …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Reflecting On a Year of Selected Datasets

Posted by: Pedro Gonzalez-Fernandez

Introduction The Selected Datasets Collection was publicly launched June 2020 as part of the Library’s ongoing efforts to support emerging data-driven styles of research. Since then, our initial offering of twenty datasets has grown to nearly 200 unique items, and we’ve continued to refine the technical workflows by which content is prepared and delivered to …

Dozens of squares, each with its own individual color or shade, lined up in rows and columns

Selected Datasets: A New Library of Congress Collection

Posted by: Pedro Gonzalez-Fernandez

Friends, data wranglers, lend me your ears; The Library of Congress’ Selected Datasets Collection is now live! You can now download datasets of the Simple English Wikipedia, the Atlas of Historical County Boundaries, sports economic data, half a million emails from Enron, and urban soil lead abatement from this online collection. This initial set of …

hand colored wood-block print from the Urdu narrative poem (masnavi) written about a legendary folk romance

What’s New Online at the Library of Congress: April 2024

Posted by: Carlyn Osborn

Interested in learning more about what’s new in the Library of Congress’ digital collections? The Signal shares updates on new additions to our digital collections and we love showing off all the hard work of our colleagues from across the Library. Read on for a sample of what’s been added recently and some of our favorite highlights. Click here for …

Graphic of a word cloud that shows keywords for FDDs

A Picture is Worth a Thousand Data Points: Visualizing File Format Data

Posted by: Liz Holdzkom

Technologist Ashley Blewer is using the format description document XML files to gather data across all FDDs. She has pulled this data into data visualization tools which allow us to see what categories our FDDs fall into and how many FDDs are being updated (by category, each year). These visualizations help the formats team to identify issues and streamline internal review and update processes.

1870s songsters covers for "Brigham Young" and "Love among the roses"

What’s New Online at the Library of Congress: February 2024

Posted by: Carlyn Osborn

Interested in learning more about what’s new in the Library of Congress’ digital collections? The Signal shares updates on new additions to our digital collections and we love showing off all the hard work of our colleagues from across the Library. Read on for a sample of what’s been added recently and some of our favorite …