Honestly might be more indicative of how far behind vision is than anything. Des...

polytely · 2025-07-17T18:04:37 1752775477

Is there anyone trying to solve OCR, I often think of that annas-archive blog about how we basically just have to keep shadow libraries alive long enough until the conversion from pdf to plaintext is solved.

https://annas-archive.org/blog/critical-window.html

I hope one of these days one of these incredibly rich LLM companies accidentally solves this or something, would be infinitely more beneficial to mankind than the awful LLM products they are trying to make

Metricon · 2025-07-18T02:33:46 1752806026

You may want to have a look at Mistral OCR: https://mistral.ai/news/mistral-ocr