Friends, data wranglers, lend me your ears; The Library of Congress’ Selected Datasets Collection is now live! You can now download datasets of the Simple English Wikipedia, the Atlas of Historical County Boundaries, sports economic data, half a million emails from Enron, and urban soil lead abatement from this online collection. This initial set of …
This is a guest post by Jennifer “JJ” Harbster, Head of the Science Reference Section in the Library’s Science, Technology and Business Division. She had her first taste of web archiving with the Internet Archive’s collaborative project documenting Hurricane Katrina and went on to lead the Science Blogs Web Archive. On April 22, 2020 we …
This is a guest post by Kristy Darby, a Digital Collections Specialist in the Digital Content Management Section in Library Services. We are excited to share that anyone anywhere can now access a growing online collection of contemporary open access eBooks from the Library of Congress website. For example, you can now directly access books …
It has been just over a year since we kicked off a deep dive into the Library of Congress Web Archives on the Signal! Now at over 2 petabytes, the web archives are a complex aggregation of interrelated web objects that make up the internet as we know it (images, text, code, audio, video, etc.). …