Hacker News new | past | comments | ask | show | jobs | submit login

> Currently we are looking into developing an OCR API. Would that be useful for you?

Oh yes! Absolutely! However, I only would use it for private purposes, I do not plan to write an application for it, that I'd release. I would do a lot of preprocessing (mostly cutting into boxes, which I would then save in separate bitmaps) and then upload that.

Just these days I am in the process of converting the scans of a recipe-book, that is not available electronically, but which I want to have on my smartphone. There is a PDF online, but that is only a slideshow of bitmap images. It also has ornamental frames on each page, ornaments in text, etc. And as I did not find the results of open source OCR apps to be satisfactory, I started roaming the commercial sector (I still have to evaluate ABBYY Finereader, but that would be very expensive).

Ideally, so I thought, a household would have an account, that is realized via a basic subscription plus pay by volume, and then available as a REST API. The subscription provider would offer a simple user client like https://www.roxyappsdev.com/applications/windows-10-applicat... and a mobile client (think "document scan", with batch and auto deskew/process) but otherwise work (hard ;-)) on the recognition. Now, if hand-writing would also be possible, that would be great!

BTW: What is 'OSD'? And why not some graphics, that explain the different page-selection/-recognition modes?




Join us for AI Startup School this June 16-17 in San Francisco!

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: