What will prevent a search engine from indexing a website?
Many search engines cannot crawl sites that use frames, or sites that are created using flash. Sites using frames should be converted to an unframed site. CSS (cascading style sheets) can be used instead of frames. Most search engines, including Google, cannot crawl pages that use “&id=” as a parameter in the URL. Websites that use dynamic URLS for user tracking, must allow robots to crawl a website without requiring session IDs or URL parameters. Search engines cannot read/follow JavaScript links. Replace JavaScript links with plain HTML links or place an additional section for HTML links on your website if the JavaScript links are necessary. Most search engines do not index IP addresses – you must submit a domain name, not an IP address. A search engine cannot index a website where the robots.txt file specifies a disallow statement for that engine.