Can SocSciBot 4 be used for Blog link analysis?
Yes but there are some issues. People who have studied blog links in the past have found that they are quite rare – most blogs don’t have any links to other sites other than the blogrolls. But this is not true for some genres of blogs – e.g., news filtering blogs. There is a problem with automatically extracting accurate and complete data from blogs. Blogs are quite repetitive – the same content could be on several pages in different formats and with different URLs (e.g., a page with one post; a monthly archive containing the same post and other posts, as well as the home page which may also contain the same post) – and there are different kinds of links on pages. The current recommendations are as follows: Crawl all the blogs that you are interested in, keeping all the crawls together in the same SocSciBot 4 project. Once the crawls are finished, analyse the links using the “domain” links rather than normal full URLs. In SocSciBot 4 this is in the File Type options menu. This stops mu