Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes, composing characters are what I was referencing with "higher level Unicode shenanigans". This doesn't stop there though - many people would say that "а" and "a" are encodings of the same character even if Unicode thinks otherwise. All that is above the concerns of UTF-8 though, which only cares about encoding code points into byte sequences.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: