OCR would likely work, however I would need to render everything out with a browser and do (probably) more processing. I also wanted a way to preserve text styling exactly as the web viewer shows it, not sure if OCR supports detecting text alignment.
Dumb question: why not flip through the pages, take screenshots, then use OCR? Maybe some kind of jank Amazon DRMalike?