How can I get htdig to index Chinese, Japanese or Korean text?
You can’t do that yet. Current versions of ht://Dig only support 8-bit characters, so languages such as Chinese, Japanese and Korean, which require 16-bit characters, are not currently supported. The same goes for documents in any language if the document is encoded in anything but simple 8-bit character sets. Unicode and UTF-8 documents are not supported. There are long-range plans to add support for these, but it’s a huge task that no developer has taken up yet.However, you can index documents using any 8-bit only encoding that is supported by a locale on the indexing system, by following the guidelines in questions 4.10 and 5.8.