Hacker Newsnew | past | comments | ask | show | jobs | submit | more unicornporn's commentslogin

> That would allow constant archival of every webpage a user ever visits -- an immutable record over the user's years of crawling the web.

This is usually solved by using a proxy: http://netpreserve.org/projects/live-archiving-http-proxy/


Can this be combined with webrecorder? Does anyone know someone who's done this? I only use about 30GB of traffic a month so a 2TB $70 hard drive would last me almost six years.


https://github.com/webrecorder/webrecorder can be run using Docker. There's also plenty of Proxys that can save your browsing. See: http://netpreserve.org/projects/live-archiving-http-proxy/


In what way could this considered to be “your own internet archive”? I see no way to register a user and save pages to a collection.

If you really want to create your own archive, set up a Live Archiving HTTP Proxy[1], run SquidMan [2] or check out WWWOFFLE[3].

If you want something simpler, have a look at Webrecorder[4] or a paid Pinboard account with the “Bookmark Archive”[5].

[1] http://netpreserve.org/projects/live-archiving-http-proxy/

[2] http://squidman.net/squidman/index.html

[3] http://www.gedanken.org.uk/software/wwwoffle/

[4] https://webrecorder.io/

[5] https://pinboard.in/upgrade/


Great points.

You're right, for now it's a single rate-limited HTML form and you'll have to manually collate the links to the archives you create. I'll be adding specialty features (with accounts) next. :)


Another pair of even simpler solutions:

Print and store pages as PDFs.

Download and save entire pages as webarchives (Safari, wget)


I was about to switch to this about a month ago but realized WSL does not work with LTSB. So no BASH for Windows. It will probably be more than a year too...


The Bash that comes with Git (which uses msys AFAIK) works pretty well.


It uses MSYS2 actually, which is a completely different project.


To expand on that: MSYS was a fork of Cygwin 1.3, and MSYS2 is a fork of Cygwin 1.7. (Babun is a distro of Cygwin, and Git-bash is a distro of MSYS2.)


Can it run executables compiled for Ubuntu?


I'd be surprised if WSL ever made it to LTSB. Especially if you read the article and MS's definitive of what the intended use case is.


Is Cygwin still maintained? It was the first thing I installed on Windows going back over a decade.


Yes, but I prefer MSYS2 these days. It's a lot like Cygwin, but uses pacman as a package manager instead of Cygwin's setup.exe.


Zero Books: Advancing Conversations:

http://zero-books.net/blogs/zero/feed/podcast/

Team Human:

https://teamhuman.fm

Internet History Podcast:

http://www.internethistorypodcast.com

Featured Voices:

http://www.peakprosperity.com


> I have a theory that women are less likely to be, I guess the word would be "assertive", than men.

That's a descriptive, not necessarily a normative, statement. What's more interesting to me is why women and men act differently. Do you believe the differences could be the result of child rearing, social climate and norms etc?


Facebook posts are more or less unreadable to me without a Facebook login. This is what it looks like to me: https://i.imgur.com/ZQNexgi.png

Does anybody else outside Facebook space find it annoying?


Yup, and not because I don't have a Facebook, only because I'm not logged in on my work machine.


Yes, I'm amazed that the CDs I ripped to FLAC 14 years ago still don't need conversion to a new format to play in all the software I use. I don't think I expected that when I ripped them.

It's important to understand that it's not only compression that gives FLAC an edge over AIFF/WAVE. Vorbis comments are an excellent way to save metadata in audio files.


If you want metadata, put it in a RIFF chunk.


What (and when something) ends up on the first page never ceases to surprise. I've used this I don't know how long. Could it be 15 years? Their official tagging client (Picard) is OK, but I prefer tagging using Mp3tag and the MusicBrainz database.


Seconded. mp3tag and the MusicBrainz database are marvelous. I used to have a music collection of 25.000+ tracks, and once I discovered aforementioned tools it was an absolute blessing, especially for tracks that that had poor (or non-existant!) tags.

I'm fully into Spotify now, minus the 1000 or so tracks I couldn't match, but damn, does talking about this take me back. Lugging around my 160Gb iPod Classic and still not being able to fit all my music. UT2004 instagib & Counter Strike LAN parties and swapping entire media libraries.. movies and series included. It was a fun time :)


I upvote many things that I already know of to let others become aware of them. It's a bit like xkcd #1053 [1], when I encourage others to read about something, they might then write comments about that thing offering new insights etc. that is of value to me. Additionally, them having become aware of that thing now might lead to them caring about that thing and doing something related to that thing that is of value to me in the future. But really my primary motivation is simply that I think that things I like deserve that I upvote them so that others can enjoy those things too regardless of future utility to me.

[1]: https://xkcd.com/1053/


Try Jaikoz (http://www.jthink.net/jaikoz/). It's Shareware but very capable. Uses MusicBrainz and Discogs, fingerprinting and whatever.


I fear now an entire generation (Gen Z) grows up with services like Youtube, Musical.ly, Spotify and iCloud/GoogleCloud for their photos.

They never interact with images and audio files, they don't know about metadata at all. They don't use notebook or a PC. They are vendor locked to iOS or Android. They are not dumb, but less and less have the initiative search around and try out new things outside the box. They stay inside their apps, they don't know the vast web outside that can be searched with Google search engine. (it depends on parents and schools to inspire them to try out more)


Aww piffle. :)

It's not like the previous generation of people all explored the vast web. They didn't. They didn't even use it until relatively recently.

The percentage of those that are intrigued by technology and have the wanderlust to explore the digital landscape are probably exactly the same (perhaps more now) as the previous generation. Those people that you refer to as "vendor locked" now would never have even used computers in the past generations, or used them only for Office apps.


That's actually what we do in some schools of zen when we do zazen.

We begin to steady and stabilize the mind by counting the breath. We practice by counting each inhalation and each exhalation, beginning with one and counting up to ten. Inhale — at the end of the inhalation, count one. Exhale — at the end of the exhalation, count two. When you get to ten, come back to one and start all over. The only agreement that you make with yourself in this process is that if your mind begins to wander — if you become aware that what you’re doing is chasing thoughts – you will look at the thought, acknowledge it, and then deliberately and consciously let it go and begin the count again at one.

It eventually stills the mind and will help you, or at least me (the few times I have trouble falling asleep), to wander into zzzzz land.


> It eventually stills the mind and will help you

I does still the mind, but I have a problem that it makes me even more awake. A couple of times I tried that I ended up going back to my laptop and working through the night because I felt like I gained so much energy.


> I does still the mind, but I have a problem that it makes me even more awake.

Then skip the counting and just focus on your breathing (shikantaza). If that also is too much to focus on, do it for 20 minutes or so and then just try to sleep the regular way. Hopefully your brain wont be monkey minding as much as it did before the excercise.

I want to stress that this is not zazen — when doing zazen you should not fall asleep — but a method that I think can be useful also when we try to fall asleep.


That is a clear sign that you're overworking your brain.


Yes - I've come across the same breathing suggestions via Headspace.


Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: