Shiny new torrents are up!!
Shiny new torrents are up!!
I didn't expect torrents to go from a pirate's best friend to resisting fascism so abruptly....but here we are. #fascism #datapreservation #dataprotection
@hacks4pancakes perhaps something for #DataHoarders https://www.tomshardware.com/tech-industry/big-tech/data-hoarders-race-to-preserve-data-from-rapidly-disappearing-u-s-federal-websites and others: https://connect.oeglobal.org/t/webinar-federal-data-disappearing-and-who-is-saving-it/7356 @SafeguardingResearch
NEW: Director of Wayback Machine, Mark Graham @mark dives into the End of Term @internetarchive campaign; reveals how researchers & journalists fight to preserve government data before it disappears.
#DataPreservation #journalism
EDIT: EMERGENCY! SOME OF THE DATA SETS FROM ARCHIVE.ORG ARE NOW LOCKED. DOWNLOAD THESE NOW!
My US government data hoarding page is up and ready with links and torrents. The torrents are all being seeded by my junkbox torrent server. I will continue to add torrents as I download things.
16TB collection of federal public datasets linked by data.gov, including over 311,000 datasets harvested during 2024 and 2025, are now part of a new data vault project seeking to preserve vital research data.
https://lil.law.harvard.edu/blog/2025/02/06/announcing-data-gov-archive/
Due to the current data preservation emergency, I'm pretty sure I'm going to find at least 20 terabytes to stuff into an old 8 drive tower that I'm building out of junk box parts and then I will host government data mirrors that the fascists have wiped out. #datahoarder #datapreservation #fascists
Are you a fellow data hoarder? Have some spare terabytes? Start here:
https://commoncrawl.org/blog/january-2025-crawl-archive-now-available
https://meta.wikimedia.org/wiki/Data_dump_torrents#English_Wikipedia
https://github.com/end-of-term/eot2024
https://github.com/internetarchive/dweb-mirror
https://archive.org/details/20250128-cdc-datasets
Is anyone out there creating an archive of the EPA’s website? There is a lot of valuable information that could be lost if someone were to do something stupid like, say, remove every mention of climate change.
@mia is up next on integrating enriched metadata into collections platform. "Have we looked after the data we asked people to contribute?"
Me: this sounds like a call for #DigitalPreservation #DigiPres #DataPreservation
There does not seem to be an #AccurateRip compatible CD ripper for headless #Linux machines
I know we're at the end of the era of the #AudioCD, but who is doing preservation here?
@nytpu I think that's more businesses being filled with idiots that don't care about #archival and #DataPreservation.
"Worth" shouldn't really be a factor at all.
If I had my way, governments would invest a *lot* more in those multi-century lifespan archival optical discs to try & make that petabyte disc (https://en.wikipedia.org/wiki/Hyper_CD-ROM) a thing so that long-term archival in government-provided facilities similarly to various already existing national archives would become vastly more widespread.