There appear to be garbage characters in the HTML. Whats up?
What you’re seeing is most probably not garbage, but the result of a mismatch between the actual encoding of the HTML output and the encoding your browser is expecting. Your browser is misinterpreting the HTML data, which is encoded text. A discussion of text encodings is beyond the scope of this FAQ; see one or more of these documents for more info: • UTF-8 and Unicode FAQ for Unix/Linux • Chapters 3 and 4 of Introduction to i18n [Internationalization] • Python Unicode Tutorial • Python Unicode Objects: Some Observations on Working With Non-ASCII Character Sets The common case is with the default output encoding (UTF-8), when either numbered sections are used (via the “sectnum” directive) or symbol-footnotes. 3 non-breaking spaces are inserted in each numbered section title, between the generated number and the title text. Most footnote symbols are not available in ASCII, nor are non-breaking spaces. When encoded with UTF-8 and viewed with ordinary ASCII tools, these characters will a
What you’re seeing is most probably not garbage, but the result of a mismatch between the actual encoding of the HTML output and the encoding your browser is expecting. Your browser is misinterpreting the HTML data, which is encoded text. A discussion of text encodings is beyond the scope of this FAQ; see one or more of these documents for more info: • UTF-8 and Unicode FAQ for Unix/Linux • Chapters 3 and 4 of Introduction to i18n [Internationalization] • Python Unicode Tutorial • Python Unicode Objects: Some Observations on Working With Non-ASCII Character Sets The common case is with the default output encoding (UTF-8), when either numbered sections are used (via the “sectnum” directive) or symbol-footnotes. 3 non-breaking spaces are inserted in each numbered section title, between the generated number and the title text. Most footnote symbols are not available in ASCII, nor are non-breaking spaces.
Related Questions
- Characters that are "special" are forbidden by EDGAR Filer Manual v11 6.10.6 but they appear in the Original HTML/ASCII, so how do I get them into labels?
- Why do some characters like asterisks or dots appear on the screen after turning the TV on?
- There appear to be garbage characters in the HTML. Whats up?