Hacker News new | past | comments | ask | show | jobs | submit login

Given this can produce code when prompted, could it also be used to interpret html from a crawler and then be used to scrape arbitrary URLs and extract structured attributes? Basically like MarkupLM but with massively more token context?



Also curious about this. There must be a better way to scrape using LLM.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: