Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

That is amazing!! I will check that out whenever it's done. Thank you for your work!

Dumb question: why not flip through the pages, take screenshots, then use OCR? Maybe some kind of jank Amazon DRMalike?



OCR would likely work, however I would need to render everything out with a browser and do (probably) more processing. I also wanted a way to preserve text styling exactly as the web viewer shows it, not sure if OCR supports detecting text alignment.

Also I've released the tool and a post about it here: https://news.ycombinator.com/item?id=45610226


Just saw it at #1! Congrats and thank you!




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: