How does Swish-e know which parser to use?
Swish can parse HTML, XML, and text documents. The parser is set by associating a file extension with a parser by the IndexContents directive. You may set the default parser with the DefaultContents directive. If a document is not assigned a parser it will default to the HTML parser (HTML2 if built with libxml2). You may use Filters or an external program to convert documents to HTML, XML, or text.