I mean, I'd totally try Tesseract[1], a few samples, and a python script. Should...

mrazomor · on June 2, 2024

Tesseract out of the box is terrible for anything non standard. I tried using it for the comic books. Unusable. The training for your font is doable, but it's very time intensive (while the tools are pretty good!).

motoxpro · on June 2, 2024

I'd say any of the language models are far better than Tesseract. I did some work in this space and it was an absolute nightmare, event working with pdfs.

driscoll42 · on June 2, 2024

For OCR of handwriting, I did some comparative analysis a year back, and I found that Tesseract was... not good. However TrOCR was okay, certainly the best of the FOSS solutions. But Textract from Amazon was the best one by far far for handwriting, though your mileage will vary

123yawaworht456 · on June 2, 2024

from my experience with tesseract ~1 year ago, it was frequently fucking up even with crispy PNG screenshots

I really doubt it can handle handwriting

radiantspace · on June 2, 2024

Handwritten notes, cmon! Don't waste time on tesseract for that.