Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

What's the PDF parsing like?


Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.


You are not doing any of the fancy table extractor stuff?




Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: