Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I tried https://github.com/PaddlePaddle/PaddleOCR for my own use case (scanline images of parcel labels) and it beat Tesseract by an order of magnitude.

(Tesseract managed to get 3 fields out of a damaged label, while PaddleOCR found 35, some of them barely readable even for a human taking time to decypher them)



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: