Isn the canonical ordering for Arabic characters wrong?
The Unicode Standard does not guarantee that the canonical ordering of a combining character sequence for any particular script is the ‘correct’ order from a linguistic point of view; the guarantee is that any two canonically equivalent strings will have the same canonical order. In retrospect, it would have been possible to have assigned combining classes for certain Arabic and Hebrew non-spacing marks (plus characters for a few other scripts) that would have done a better job of making a canonically ordered sequence reflect linguistic order or traditional spelling orders for such sequences. However, retinkerings at this point would conflict with stability guarantees made by the Unicode Standard when normalization was specified, and cannot be done now.
Related Questions
- Various characters appear wrong (have metallic skin or barklike skin, or are partly invisible). What gives?
- Various characters appear wrong (have metallic skin or barklike skin, or are partly invisible). What gives?
- In the Official Campaign, characters I think should be friendly suddenly go hostile! Whats wrong?
- In the Official Campaign, characters I think should be friendly suddenly go hostile! Whats wrong?
- Isn the canonical ordering for Arabic characters wrong?
- How Do You Fix A Keyboard That Has The Wrong Characters?