So, copyright conditions are apparently another silent killer [1].
A website can be archived, vanish from the web, and then vanish from the archive for technical copyright reasons (new owner's robot.txt file on the root). So "archiving the archive" might be useful. Or something.
Ezboard was an old discussion site that contained much of interest - archived and now the archive is not accessible.
TIL they still follow robots.txt -- https://blog.archive.org/2017/04/17/robots-txt-meant-for-sea... mentioned that they were planning to stop doing that (and I remember reading a news article based on it claiming that they already stopped following robots.txt, hence my confusion). Truly a shame. I get following robots.txt at collection time, I don't get following a robots.txt that was added later.
A while back I tried looking up a particular ezboard forum I used to participate in, only to find it was blocked. This is a real shame. I have some of the posts saved but it's only a fraction of the full forum.
A website can be archived, vanish from the web, and then vanish from the archive for technical copyright reasons (new owner's robot.txt file on the root). So "archiving the archive" might be useful. Or something.
Ezboard was an old discussion site that contained much of interest - archived and now the archive is not accessible.
https://archive.org/post/389127/ezboard-content-suddenly-not...