The WWW7 paper mentions a “CategoryClassifier”, but I can find it in the source code. Where can I get it?
The CategoryClassifier was part of an earlier web-crawling system, SPHINX, developed at Compaq SRC. The original SPHINX code belongs to Compaq SRC and was never released. WebSPHINX is an open-source reimplementation of the SPHINX interface. CategoryClassifier was not part of this reimplementation because CategoryClassifier depended on some other software that belongs to SRC.