Hacker News new | past | comments | ask | show | jobs | submit login

Two big problems with automated spam blocking are: false positives and changing domain names.

For the second one, how often do you revise your blocked links? what if it changed owner and the new one doesn't provide spam.

For the first one, is even one false positive tolerable? Will you deny someone presence in your index because you failed? And if so, how do you handle challenges?




We don't mark a domain as spam until many of the pages we've seen look spammy.

Our ideal is to recrawl everything every 14 days, but during our launch we have not been achieving that.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: