What's the PDF parsing like? | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		Onavo 10 months ago \| parent \| context \| favorite \| on: Show HN: Free mortgage analysis tool to avoid gett... What's the PDF parsing like?

aaln 10 months ago [–]

Extract all the text from the PDF, turn the pdf into images, send the text for each page along with the image to an LLM with a desired output strucutre.

Onavo 10 months ago | [–]

You are not doing any of the fancy table extractor stuff?

Consider applying for YC's Winter 2026 batch! Applications are open till Nov 10
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact