Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Why is the encoding so important and why do XML parsers get confused?

confused encoding parsers XML
0
Posted

Why is the encoding so important and why do XML parsers get confused?

0

If, for example, an XML document saved as ASCII is sent to an OAI harvester which thinks it is receiving UTF-8, the following will happen: • ASCII values less than 127 will be fine, but • any ASCII values greater than 127 will be a problem. For example, pi (ASCII 227) in binary is 11100011. The XML parser will read this as the first byte of a three byte UTF-8 character. Completely different to what was intended! There’s a good tutorial on XML and character encoding at http://skew.org/xml/tutorial/.

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123