Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Why aren there extra characters to support locale-independent casing for Turkish?

April 26, 2017arent casing characters extra independent locale support Turkish

0

10 Posted

Why aren there extra characters to support locale-independent casing for Turkish?

1 Answer

0

Posted

The fact is that there is too much data coded in 8859-9 (with 0xDD = LATIN CAPITAL LETTER I WITH DOT and 0xFD = LATIN SMALL LETTER DOTLESS I) which contains both Turkish and non-Turkish text. Transcoding this data to Unicode would be intolerably difficult if it all had to be tagged first to sort out which 0x49 characters are ordinary “I” and which are CAPITAL LETTER DOTLESS I. Better to accept the compromise and get on with moving to Unicode. Moreover, there is a strong doubt that users will “get it right” in future either when they enter new characters.