Hacker News new | past | comments | ask | show | jobs | submit login

Thanks! I've heard of Shpider, haven't had a need for it as of yet because I haven't needed programmatic web browsing (just downloading the page and extracting info).

I'll see if I can clean up the markup enough to get it to parse with hxt, otherwise Shpider provides a good reference on how to correctly use Tagsoup. And also the Shpider codebase is really clean and well documented.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: