What do the gatherer options “Search=Breadth” and “Search=Depth” do and which keywords are available for “Search=” option?
Search option selects an enumerator for http and gopher URLs. Harvest comes with breadth first (Search=Breadth) and depth first (Search=Depth) enumerator for http and gopher. They have different strategy when following the URLs to get a list of candidates for processing. The breadth first enumerator processes all links in a level before descending to next level. In case of limiting the number of URLs to gather from a site, it will give you a more representative overview of the site. The depth first enumerator will descend to next level as soon as possible. When there are no links left for the current branch, it will process the next branch. The depth first enumerator doesn’t use as much memory as the breadth first enumerator. If you don’t have compelling reasons to switch from an enumerator to the other, the default value should be a reasonable choice.
Related Questions
- What do the gatherer options "Search=Breadth" and "Search=Depth" do and which keywords are available for "Search=" option?
- Whats the difference between using my guaranteed insurability option and using one of the flexibility options?
- What is the difference between the Biology General option and the other options in Life Sciences?