Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there anyone trying to solve OCR, I often think of that annas-archive blog about how we basically just have to keep shadow libraries alive long enough until the conversion from pdf to plaintext is solved.

https://annas-archive.org/blog/critical-window.html

I hope one of these days one of these incredibly rich LLM companies accidentally solves this or something, would be infinitely more beneficial to mankind than the awful LLM products they are trying to make



You may want to have a look at Mistral OCR: https://mistral.ai/news/mistral-ocr




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: