Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Check out pup for parsing HTML. https://github.com/ericchiang/pup

pup uses CSS selectors to select elements from HTML documents. Used in conjunction with curl, it gives you a very simple and low friction way to scrape data in scripts.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: