27 points

Not this crap again

permalink
report
reply
19 points

Wtf

permalink
report
reply
47 points

This again??

This time once archive.org is back online again… is it possible to get torrents of some of their popular data storage? For example I wouldn’t imagine their catalog of books with expired copyright to be very big. Would love a community way to keep the data alive if something even worse happens in the future (and their track record isn’t looking good now)

permalink
report
reply
15 points

Like this idea

permalink
report
parent
reply
13 points

Yep, that seems like the ideal decentralized solution. If all the info can be distributed via torrent, anyone with spare disk space can help back up the data and anyone with spare bandwidth can help serve it.

permalink
report
parent
reply
8 points

Most of us can’t afford the sort of disk capacity they use, but it would be really cool if there were a project to give volunteers pieces of the archive so that information was spread out. Then volunteers could specify if they want to contribute a few gigabytes to multiple terabytes of drive space towards the project and the software could send out packets any time the content changes. Hmm this description sounds familiar but I can’t think of what else might be doing something similar – anyone know of anything like that that could be applied to the archive?

permalink
report
parent
reply
5 points

Yeah, the projects I’ve heard about that have done something like this broke it into multiples.

For example, 1000GB could be broken into forty 25GB torrents and within that, you can tell the client to only download some of the files.

At scale, a webpage can show the seed/leach numbers and averages foe each torrent over a time period to give an idea of what is well mirrored and what people can shore up. You could also change which torrent is shown as the top download when people go to the contributor page and say they want to help host it ensuring a better distribution.

permalink
report
parent
reply
0 points

Since I’m spamming with this same idea right now - the description is similar to Freenet (the old one, the Hyphanet), but you’d need some kind of ability to choose parts of which collections of data get stored in your contributed storage, while with Freenet it’s all the network (unless you form a separated F2F net, there is such an option, but no way to be sure that all peers, ahem, store only IA data and not their own porn collections, for example, taking precious storage). I’ve described one idea in my previous comment, but it’s purely an idea, I’m nowhere close to having the knowledge to make such.

permalink
report
parent
reply
1 point

There’s an issue with torrents, only the most popular ones get replicated and the process is manual\social.

Something like Freenet is needed, which automatically “spreads” data over machines contributing storage, but Freenet is an unreliable storage, basically like a cache where older and unwanted stuff gets erased.

So it should be something like Freenet, but possibly with some “clusters” or “communities” with a central (cryptography-enabled) authority of each being able to determine the state of some collection of data as a whole, and pick priorities. My layman’s understanding is that this would be similar to something between Freenet and Ceph, LOL. More like a cluster filesystem spread over many nodes, not like cache.

permalink
report
parent
reply
1 point
*

You have more knowledge on this than I did. I enjoyed reading about Freenet and Ceph. I have dealt with cloud stuff, but not as much on a technical-underpinnings level. My first freenet impression from reading some articles gives me 90s internet vibes based on the common use cases they listed.

I remember ceph because I ended up building it from the AUR once on my weak little personal laptop because it got dropped from some repository or whatever but was still flagged to stay installed. I could have saved myself an hours long build if I had read the release notes.

permalink
report
parent
reply
8 points

I’m pretty sure all their content is available by torrent, so you could mirror the data and provide the torrent files for direct download. It’ll probably be here when it’s back up: https://archive.org/details/public-domain-archive

permalink
report
parent
reply
3 points

Anna’s Archive does this. I think its a really good way to make it difficult to take them down.

Hopefully this hack starts some conversations on how they can ensure longevity for their project. Seems they’re being attacked on multiple fronts now.

permalink
report
parent
reply
107 points

I guess this is an attempt to discredit them.

After working at many, many companies, security is usually very bad. This is typical. Not changing access tokens is also very common.

permalink
report
reply
26 points

Discrediting someone usually has a goal of pushing customers to another source though. There is no other source of this information, so what would be the point?

permalink
report
parent
reply
18 points

Generating turmoil just prior to the USA election maybe?

permalink
report
parent
reply
108 points
*

Destroy a source of historical documents so that the past can be contested. Sow doubt, confusion, deniability. Hide evidence of past crimes, or inconvenient documents. Plant documents, etc.

permalink
report
parent
reply
20 points

Now we are talking.

permalink
report
parent
reply

He who controls the past controls the future, he who controls the present controls the past.

permalink
report
parent
reply
9 points

Russians banned it, russian hackers trying to destroy it, at least it’s consistent

permalink
report
parent
reply
1 point
*

Lol, we should create a society of sorts along the lines of the original Bavarian Illuminati. Create a decentralized storage network and archive of knowledge and history. Create a list of important shit that needs to be archived, and delegate standardized chunks (let’s say 5 or 10gb each chunk) of data that are to be downloaded by people. Anytime 5 or 10 people have downloaded a chunk, strike it off the list of priority archival and move onto the next chunk. For this to work, needs alot of people though.

permalink
report
parent
reply
2 points

Sow doubt. As in spreading it like seeds to take root and grow. 100% in agreement with you, just being a grammar Nazi. Carry on.

permalink
report
parent
reply
1 point

War of attrition is my guess

permalink
report
parent
reply
80 points

Okay, enough is enough. The Internet Archive is both essential infrastructure and irreplaceable historical record; it cannot be allowed to fall. Rather than just hoping the Archive can defend itself, I say It’s time to hunt down and counterattack the scum perpetrating this!

permalink
report
reply
36 points
*

Lol you’re gonna pull that thread and at the end of the sweater is gonna be the CIA or Russia.

Edit: in = is

permalink
report
parent
reply
14 points

Did I stutter?

permalink
report
parent
reply
0 points

Israel more likely. Making an attack completely useless for Palestine and calling yourself a pro-Palestine group - would be exactly their kind of braindead, but capable.

permalink
report
parent
reply
5 points

Mossad…CIA…same dragon different head.

permalink
report
parent
reply
15 points

Where are the anonymous group and 4chan autists? They should attack these assholes. Attacking internet archive is like kicking a kitten. Everyone will hate you for it.

permalink
report
parent
reply

Technology

!technology@lemmy.world

Create post

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


Community stats

  • 18K

    Monthly active users

  • 12K

    Posts

  • 553K

    Comments