Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Could they not run webhooks where my site can call to them when to do a full or partial crawl? PubSub for web crawling, if you will.


Sitemaps (http://www.sitemaps.org/) help a bit in that regard, as crawlers can check the sitemap and only crawl updated content.


Thanks.


PUsH: http://en.wikipedia.org/wiki/PubSubHubbub

Google is the only crawler I see using the protocol on my site. It does make Google updates occur within minutes, so that alone is reason to implement push.


I thought this was only being used for RSS - interesting!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: