What is the difference is between W3C normalization and Unicode normalization?
Unicode normalization comes in 4 flavors: C, D, KC, KD. It is C that is relevant for W3C normalization. W3C normalization also treats character references (&#nnnn;) as equivalent to characters. For example, the text string “a&#xnnnn;” (where nnnn = “0301”) is Unicode-normalized since it consists only of ASCII characters, but it is not W3C-normalized, since it contains a representation of a combining acute accent with “a”, and in normalization form C, that should have been normalized to U+00E1.
Related Questions
- What is the difference between the Unicode character properties "Ideographic" and "Unified_Ideograph"?
- What is the difference between the Unicode character properties "Ideographic" and "Unified_Ideograph"?
- What is the difference is between W3C normalization and Unicode normalization?
- Whats the difference between normalization and dynamic range compression?
- Whats the difference between normalization and dynamic range compression?
- What is the difference between Unicode fonts and other fonts?