Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

How does Collexis deal with low concept density documents or queries?

0
Posted

How does Collexis deal with low concept density documents or queries?

0

A standard possibility is to index a document without a thesaurus. This process incorporates most of the indexing steps (stop words, normalization etc.), but will generate a fingerprint with word-based entries instead of concept entries. Since Collexis is able to work with multiple thesauri simultaneously, such a “free text” fingerprint can be used in addition to a thesaurus- based fingerprint and can take into account terms not present in the thesaurus. These word-based entries can relate to any number of consecutive words (bigrams, trigrams, etc.). Naturally, such a free text fingerprint does not offer the advantages of a thesaurus-based fingerprint like multilingualism, synonymy, etc.

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123