Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

How Does Unicode Handle South Asian Languages?

April 26, 2017Asian Languages south Unicode

0

Posted

How Does Unicode Handle South Asian Languages?

1 Answer

0

Posted

As a standard, Unicode is more concerned with characters and scripts than with languages. The characters used by different South Asian languages are therefore organized by script. Most often, characters in the same script have numbers that are next to each other in a character block . Currently (in Unicode 5.1) there are twelve “Indic script” character blocks in Unicode. They are: Bengali, Devanagari, Gujarati, Gurmukhi, Kannada, Limbu, Malayalam, Oriya, Sinhala, Syloti Nagri, Tamil, and Telugu. The characters used by right-to-left languages like Urdu, Pashto, and Sindhi are found on the Arabic character code chart. Tibetan has its own code chart.