Hacker News new | past | comments | ask | show | jobs | submit login

The original version of Unicode was primarily intended to unify all existing character sets as opposed to designing a character database from fundamental writing script principles. That's why most of the Latin accented characters (e.g., à) come in precomposed form.

It is worth noting that precomposed Hangul syllables decompose to the Jamo characters under NFD (and vice versa for NFC). However, most data is sent and used with NFC normalization.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: