How can I force fetcher to use custom nutch-config?
• Create a new sub-directory under $NUTCH_HOME/conf, like conf/myconfig • Copy these files from $NUTCH_HOME/conf to the new directory: common-terms.utf8, mime-types.*, nutch-conf.xsl, nutch-default.xml, regex-normalize.xml, regex-urlfilter.txt • Modify the nutch-default.xml to suite your needs • Set NUTCH_CONF_DIR environment variable to point into the directory you created • run $NUTCH_HOME/bin/nutch so that it gets the NUTCH_CONF_DIR environment variable. You should check the command outputs for lines where the configs are loaded, that they are really loaded from your custom dir. • Happy using.