What are the values for the Unicode encoding schemes?
[*] If you review the String constructor with this signature String(byte[] bytes, String encoding) you can see that one argument is a value for the encoding scheme that the conversion of 8-bit bytes to 16-bit Unicode chars is to use. There are three: “Unicode”, “UnicodeBig” and “UnicodeLittle”. The first one expects the first two bytes of your data to be a Byte Order Mark, FEFF or FFFE, which specifies whether the data is in little-endian or big-endian order. If there isn’t a BOM but you already know the endianness, you can use “UnicodeBig” or “UnicodeLittle” directly. There is also a Sun document at http://java.sun.com/products/jdk/1.1/docs/guide/intl/encoding.doc.html with some related information (not much). There is another Sun document at http://java.sun.com/products/jdk/1.1/intl/html/intlspec.doc7.html which shows the table of encodings. There is a new system property called “file.encoding” which translates between codes known to Java like “Cp1252”, and locale encoding names like