Hacker News Daily (announcement)

niyazpk · on July 14, 2010

cperciva, I really appreciate the effort you put into this.

One thing I find a bit depressing is that even though this scheme hugely reduces the time one spends on HN by filtering the articles according to votes, the top voted articles are not the ones that I would want to read. I usually come here for that "hacker" hacker news (hackerhackernews.com used to be something like this, but now the domain name has expired it seems). Anyway I am sure there will be people who find your service really useful.

Kilimanjaro · on July 14, 2010

That's exactly why I created http://www.hackerblogs.com if you check HN frontpage right now, there are zero posts by hackers, all come from PR blogs like techcrunch, readwriteweb, thenextweb, etc.

I highly recommend you giving it a try, it is still growing and I hope someday it may become the new source of real hacker news.

* I dare you to find an article like "Flipping arrows in coBurger King" on the news sites right now.

giantfuzzypanda · on July 14, 2010

Nice site, just added my blog. Only problem I see is that if it does keep growing and becomes popular, then people could add feeds of irrelevant blogs, and like most social news sites it would turn into a digg/reddit.

Kilimanjaro · on July 14, 2010

I am the curator, I check it every ten minutes (I am that addict) and believe me, I hate fud, payperpost and propaganda to death.

If I see one blog incurring in a fault, they're banned for life.

xsmasher · on July 15, 2010

I tried to register http://deadpanic.com/blog/ , but apparently didn't make the cut. Any suggestions for improvement, or insight into the criteria you use?

Kilimanjaro · on July 15, 2010

Sure it did. I checked the logs and it is registered and ready to serve your posts as soon as you publish them.

shaunxcode · on July 14, 2010

wow good work! I just registered my blog after browsing for a few seconds and being able to click lisp and see my screen filled with goodness.

ubojan · on July 15, 2010

Useful web site, added to bookmarks. But, regarding "Flipping arrows in coBurger King", I saw that entry as first news in Haskell category, submitted probably shortly after your comment :) Here is link: http://www.hackerblogs.com/post/rsfepzwv

davidw · on July 14, 2010

Yeah, I tend to find the really interesting stories are not the top of the page ones. Love in the age of the pickup artist? Sorry, couldn't care less.

cperciva · on July 14, 2010

Remember that being at the top of the front page at some point during the day does not necessarily mean that an article is in the top 10 links for the day.

I think looking at scores -- without looking at how long it took for those scores to be reached -- probably weights in favour of more "interesting" stories and against link-bait stories, simply because the link-bait tends to accumulate most of its votes very quickly.

zitterbewegung · on July 14, 2010

So the derivative in respect to points over time of the link bait story would be much higher than non link bait story?

cperciva · on July 14, 2010

I think link-bait stories probably peak earlier.

cperciva · on July 14, 2010

Out of curiosity, I just ran the numbers for Love in the age of the pickup artist. Right now, at 15:00, it's in 6th place; I suspect that by the end of the day it will fall past 10th place, since these last 9 hours are probably when the most votes are cast and that link isn't getting many votes any more.

So that story is probably a perfect illustration of when sorting by highest score alone is better than HN's score-per-time measure.

brianto2010 · on July 14, 2010

> "the top voted articles are not the ones that I would want to read."

Then, how about using the top 10 from the classic view? http://news.ycombinator.com/classic Would that help?

rmc · on July 14, 2010

Very interesting. How come that view is hidden? How does it work?

wwortiz · on July 14, 2010

I think it ranks, or at least it did when the feature was added, items only by people who have had accounts for at least a year. (I may be wrong or it may have been longer than a year)

joshfinnie · on July 14, 2010

If that is the case, then HN Classic is very good proof of the change seen in the submitted stories. I loved seeing the HN Classic homepage and might keep it as my goto page (if it is still updated).

cperciva · on July 14, 2010

It's not perfect, certainly -- over the past 3 days I've still only read about half of the top-ten links. This is about avoiding the situation where an interesting highly-voted article gets drowned out by all the noise rather than an attempt to find "good" articles which don't get lots of votes.

duck · on July 15, 2010

I started http://www.hackernewsletter.com/ last week. The selected articles I pick are based on a combination votes, comments, and items that I found valuable. I received similar feedback from a couple HN members and thought of splitting up the newsletter based on the article "types". I will be trying that out for the next edition. I think some weeks it will work better than other since the quantity/quality vary a good bit week to week.

brown9-2 · on July 14, 2010

Perhaps "Top 10 most-commented posts" would be interesting as well?

lemming · on July 14, 2010

I think this would be great, especially as an option. The posts that generate the most discussion are often interesting, and curiously aren't always the top voted.

cperciva · on July 14, 2010

That's a lot harder to do -- I can get away with scraping the front page right now because highly voted posts are likely to be on the front page for most of their vote-getting lifetime, but that argument probably doesn't hold about highly commented posts.

brown9-2 · on July 14, 2010

Hm, this is true. Perhaps you could scrape /newcomments and keep track of which stories are getting new comments?

This would hinge on the frequency of all comments to the site and having your scraping interval be less than the difference of the most recent comment and last comment displayed on /newcomments.

I think it'd just be really great to have another view into "Most active discussions".

what · on July 14, 2010

You could maybe check /active ? Maybe the highly commented sit on there for most of their comment-getting lifetime.

ergo98 · on July 14, 2010

I've taken lots of downvotes lately to say that HN is becoming less hacker and far more generalist.

That happened to virtually every other site: Slashdot, Digg, Reddit. As the site gets attention (because of its core strength of great hacker content) it starts drawing in a non-core audience. Because it's a voting system the non-core articles will start to grow in influence. It loses what drew people in in the first place.

Ah well. It's the circle of sites.

icey · on July 14, 2010

You're probably getting downvoted because of the age of your account. (Not that I think you're wrong)

ergo98 · on July 14, 2010

Possibly. I've of course lurked for much longer. Even had a couple of other accounts at some point but lost the credentials an..meh. Easy set up.

alexkay · on July 14, 2010

I second this! I would gladly pay 5-10 bucks a month for a daily digest of top hacking-related articles (no business, no politics and no Apple rumours/speculations). HN used to be 80% programming and 20% business-related, now it's the opposite.

edu · on July 14, 2010

I'm the only one the remembers when this was called Startup News?

ramchip · on July 14, 2010

Business is a lot wider than startup, though. There has been a huge number of articles about Apple - I seriously don't understand how such a massive quantity of (often gossipy) articles about the ipad/iphone and friends got upvoted, especially considering their hacker-unfriendliness. Then also a lot about Google, Facebook, Microsoft, the RIAA, general politics...

edanm · on July 14, 2010

I have to say that's surprising. I'm relatively new here, but I thought most people would get here through Paul Graham/YCombinator, which has more to do with business than programming.

shaunxcode · on July 14, 2010

personally I came here because I would randomly check paulgraham.com ever since his articles relating to lisp were being posted on slashdot. One day the YC link appeared which in turn had a link to hacker news. While I have gained an appreciation for start up culture(definitely the hacking approach to business) since then I still primarily hang around for articles on programming.

wingo · on July 14, 2010

Paul Graham also used to be 80% programming and 20% business.

bradendouglass · on July 14, 2010

If you are looking for something that is 100% programming then http://news.usethesource.com/news is a good choice. It is built on the same platform as HN

Disclaimer - I have no affiliation with the amazing peeps at usethesource

alexkay · on July 15, 2010

Thanks! That's more or less what I was looking for.

elai · on July 14, 2010

maybe we need categories? Biz&politics, social, apple, math, IT, programming, and other as an initial set?

roadnottaken · on July 14, 2010

then, before you know it, you're Slashdot!

heffay · on July 14, 2010

or worse, reddit/digg

bsnss-mn-cdr · on July 14, 2010

It would be horrible for the guys at YC to have created a $200m+ business, right? http://techcrunch.com/2008/07/22/google-in-final-negotiation...

troels · on July 14, 2010

This doesn't really solve the problem though. There's still to much information to manage it.

I wonder if any social-links-sites (hacker news, reddit etc.) have tried with an algorithm similar to that which last.fm uses, where you get suggestions on stories based on which previous stories you have shown preference for? (I'm sure this sort of ranking has some fancy name as well)

If not - Why?

panacea · on July 14, 2010

Reddit certainly attempted this a few years ago. They had a 'recommended' tab in their main navigation. It worked about as well as their search at the time (ie. completely useless).

If I recall, it was based on your activity (this was pre-subreddits) so if you interacted with lots of political stories, for instance, similar political stories would be suggested.

The problem was if you downmodded a bunch of articles involving Ron Paul, your recommended links would be Ron Paul articles.

troels · on July 14, 2010

"The problem was if you downmodded a bunch of articles involving Ron Paul, your recommended links would be Ron Paul articles."

I could think of an obvious solution for that problem.

alexandros · on July 14, 2010

can anyone find references as to why the feature was removed?

zck · on July 14, 2010

I don't know, but I imagine the resources they needed to commit to it were pretty hefty.

jackowayed · on July 14, 2010

I really like this idea and will give it a try, but FYI http://news.ycombinator.com/best serves a similar purpose.

I'm not sure of the exact criteria for it, but its basically very highly-voted articles that are fairly recent.

I've been thinking of limiting myself to /best to keep my HN time in check, so you're definitely onto something.

cperciva · on July 14, 2010

The problem with /best in my view is that links gradually come and go -- it's the right format for "what are the recent interesting stories", but it's not the right format for "what are the most interesting stories since the last time I was here".

sliverstorm · on July 14, 2010

If there is an RSS of /best, the answer is like 10 keystrokes away.

alexqgb · on July 14, 2010

More evidence that curation, not creation, is the point of sharpest demand in the media business. The immediate challenge, I suppose, is finding the right mix of focus and serendipity.

More broadly, the challenge is having my own life modeled well enough so that information relevant to short, medium, and long terms plans gets reformulated appropriately.

I could see this leading to a point where 'news' is not something I check in the morning over coffee. Rather, it's a feature that presents itself whenever I shift my attention to doing another thing (e.g. working on project A, planning weekend B, etc.)

The really fascinating thing would be getting updates about apparently tangentially related items. It's the classic 'local angle', only with regard to activity, not place.

Jun8 · on July 14, 2010

Thanks for doing this, interesting tool but I won't be using it! Let me explain.

I have often found that the articles I enjoyed most at HN were the outliers, i.e. not the ten most upvoted ones (anecdotal evidence, never tested this quantitatively). In fact this is what makes HN interesting: the quirky entries. My guess is that most of the 10 articles you select will be already covered by other such sites, diminishing the value of coming to HN in the first place.

The end effect will be similar to Hollywood blockbuster effect. It's not that I don't like to go a blockbuster movie, but I don't want to watch those all the time.

smork · on July 14, 2010

I subscribed to the RSS feed but somehow the content in google reader is truncated to this:

"The 10 highest-rated articles on Hacker News on July 13, 2010 which have not appeared on any previous Hacker News Daily are: "

Would be nice to have the full post in there with the 10 links :)

cperciva · on July 14, 2010

My blog code defaults to only putting one paragraph into the RSS feed -- for most of my blog posts this works well. I've adjusted my script so that future dailys will include the list of links in the RSS feed.

smork · on July 14, 2010

Cheers, thanks for the quick fix :)

epi0Bauqu · on July 14, 2010

Another one I made for myself a while ago: http://hacker.watrcoolr.us/. It includes some other feeds as well as the stories that reach #1 on HN. If you just want the HN stories that reach #1, use http://feeds2.feedburner.com/HNWatrcoolr

bsd_junkie · on July 14, 2010

There are a number of article recommendation engines out there that can fill the need for "outlier" articles fitting even the most peculiar tastes. I personally use http://www.euraeka.com and even though it aggregates news from less hardcore programming sources I find it an incredibly powerful source of science and technology news that fit my taste. I tried Digg and Reddit recommendation engines but they all work on user-to-user based recommendations and most of the time i get either inaccurate or trivial recommendations.

barkmadley · on July 14, 2010

you beat me to it! I should have a prototype working by the end of the week for something similar (but hopefully better as well). It was a learning exercise for me so I didn't really lose any time.

pclark · on July 14, 2010

Is there a service that aggregates all the hacker news aggregators?

mkramlich · on July 14, 2010

since everyone's ideal is diff prob the most surefire and arguably easiest way for you to get the particular aggregated and/or curated view is for you to write a small script that does exactly what you want. if others might want same then make the code avail with a link from your HN profile. If a particular service or view hack becomes popular perhaps PG will add an equiv feature to HN itself, etc.

roadnottaken · on July 14, 2010

This is a pretty cool idea and a perfectly simple implementation. However, I think you're solving a problem that doesn't exist: If I wanted efficiency, I wouldn't be reading blogs and news aggregators in the first place. I come to websites like HN to relax and browse through interesting articles and discussions -- sort of like leafing through a good magazine. If you distill it down to 10 articles then suddenly I'm finished reading and I can get on with my work... Too soon!!!

andrewtj · on July 14, 2010

I'd like to PayPal you $5 toward obtaining daily.hn ($62 + whatever the tax is on gandi.net) — anyone else?

jrockway · on July 14, 2010

My browser has a working bookmarks implementation, so I don't care what the domain name is.

andrewtj · on July 15, 2010

My bad, I get excessively terse when I'm tired. I wasn't suggesting it out of utility but out of appreciation — nothing says thank you quite like a superfluous vanity domain.

EDIT: Just noticed my other post has garnered at least one down-vote so although I'm pretty sure this idea doesn't have legs, on the off-chance you dear reader are one of ~13 other folks who'd like to see this happen, drop me an email to express your interest.

aptsurdist · on July 14, 2010

You might be interested in checking out the page - http://news.ycombinator.com/best Sounds a lot like what you're doing.. I'm not sure if others know about it already - I think I stumbled upon it by accident.

cperciva · on July 14, 2010

Discussion about direct link (I'm not sure which people will vote for, the site or the announcement of the site): http://news.ycombinator.com/item?id=1514039

elai · on July 14, 2010

Could you make it that the actual news articles show up in the RSS feed when displayed on google reader? I don't want to have to click the page and then click the article to actually view it.

revorad · on July 14, 2010

How do you decide the top 10 links of the day if you scrape every 5 minutes? (the top links keep changing). Couldn't you just scrape once a day?

cperciva · on July 14, 2010

The /news page ranks links based on score and time since submission. I'm only ranking links based on score (and whether it has been on a previous daily).

I could scrape the entire site at midnight each day, but I think PG would be very unhappy with me if I did that. Scaping /news every 5 minutes imposes much less load, and since highly ranked links get almost all of their votes prior to falling off the front page, this gives me almost as much information.

revorad · on July 14, 2010

I'm guessing that for your purpose, you might catch more interesting stories from the /classic page than /news.

Do you mind sharing the scraping code?

cperciva · on July 14, 2010

I'd prefer to not post the code publicly, simply because I don't want to encourage people to put extra load on the HN server -- but if you want a copy, send me an email and I'll provide it.

revorad · on July 14, 2010

Thanks. Just emailed you.

amethyst · on July 14, 2010

Thanks Colin, I just added this to the feed list for http://planethn.com :)

tel · on July 14, 2010

The RSS feed doesn't actually display the links, which makes this sort of thing nearly useless to me. Is this just a bug?

_zhqs · on July 14, 2010

Are points of a story only a sum of direct up votes? Or does it already factor in age, number of comments, etc.?

giantfuzzypanda · on July 14, 2010

Yes, but according to the FAQ they're ranked like this:

"On the front page, by points divided by a power of the time since they were submitted. Comments in comment threads are ranked the same way."

cperciva · on July 14, 2010

It's just the number of votes.

kloncks · on July 14, 2010

I'd be interested to know what you programmed this in. Do the .HTML pages mean you used PHP? If them, cURL?

Lemme know.

cperciva · on July 14, 2010

I use FreeBSD's fetch(1) to download the page, but curl or wget would have worked just as well. Extracting the data I want (item #, score, and link) is a few lines of perl. Managing the data over the course of the day and writing out the final HTML is done using standard BSD text utilities (sort, join, comm, cut).

kloncks · on July 14, 2010

Thanks!

hackermom · on July 14, 2010

Part of the problem is how Hacker News seemingly have turned into something more akin of News. The hacker tidbits are far and few between, and the overall amount of new submissions have skyrocketed.

duck · on July 15, 2010

I agree. I wonder how it would work if it cost you X karma points for every submission? Maybe X varies based on keywords in the title too.