Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Specifically:

https://wiki.archiveteam.org/index.php?title=Restoring

which mentions

https://github.com/hartator/wayback-machine-downloader

and also this tip:

> This is undocumented, but if you retrieve a page with id_ after the datecode, you will get the unmodified original document without all the Wayback scripts, header stuff, and link rewriting. This is useful when restoring one page at a time or when writing a tool to retrieve a site:

> http://web.archive.org/web/20051001001126id_/http://www.arch...

From the downloader's issues, you may or may not need to use this forked version if you encounter some errors:

https://github.com/hartator/wayback-machine-downloader/issue...

https://github.com/ShiftaDeband/wayback-machine-downloader



The underscore trick is what I used to revive HgInit (now lives at hginit.github.io). But it wasn’t a lot of pages, so it wasn’t scripted.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: