Who is using Heritrix?
Below is listing of users of Heritrix (To qualify for inclusion in the list below, send a description of a couple of lines to the mailing list). • The National and University Library of Iceland: Crawls the entire .is domain (~11,000 domains) using Heritrix. Has performed complete snapshot using Heritrix 1.0.4 (35million URIs) and plans on running three more snapshots in 2005. See 1385. • The National Library of Finland: Has used Heritrix to crawl Finnish museum sites and sites pertaining to the June 2004 European parliament elections. The main crawl done in 2004 was of Finnish university sites (~4million URLs). Kaisa supplies more detail on how this larger crawl was done: 1406. • metainfo: Geometa.info is a search machine for spatially related geo-data, geo-services and geo-news for Switzerland, Germany and Austria. We use Heritrix with specialised plugins to find geo-relevant datas and websites. This are formats like Geotiff-, GML-, Interlis-, ESRI-files, WFS- or WMS-services and othe