Hacker News new | past | comments | ask | show | jobs | submit login

Even if you need, for example, word boundaries, you might be better off pretending that no non-ASCII non-word chars exist than pretending that everything is valid UTF-8 or whatever. Depends on your inputs and requirements.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: