Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
witty_username
on Dec 28, 2016
|
parent
|
context
|
favorite
| on:
Library-managed 'arXiv' spreads scientific advance...
Well, less can extract the text; so I don't see why that's an issue.
vsl
on Dec 28, 2016
|
next
[–]
It’s only an issue with crap PDFs (it’s possible to omit text information or obfuscate it to the point you copy & paste garbage out of them; pdfTeX-created PDFs are of course fine).
CJefferson
on Dec 28, 2016
|
prev
[–]
It doesn't work often for multi column pdfs or tables, ligatures are usually misparsed, and maths is just destroyed.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: