Important Notice: Our web hosting provider recently started charging us for additional visits, which was unexpected. In response, we're seeking donations. Depending on the situation, we may explore different monetization options for our Community and Expert Contributors. It's crucial to provide more returns for their expertise and offer more Expert Validated Answers or AI Validated Answers. Learn more about our hosting issue here.

Is there a high level summary that shows how SpamBayes works?

high level shows SpamBayes summary
0
Posted

Is there a high level summary that shows how SpamBayes works?

0

There are eight main components to the SpamBayes system: • A database. Loosely speaking, this is a collection of words and associated spam and ham probabilities. The database says “If a message contains the word ‘Viagra’ then there’s a 98% chance that it’s spam, and a 2% chance that it’s ham.” This database is created by training – you give it messages, tell it whether those messages are ham or spam, and it adjusts its probabilities accordingly. How to train it is covered below. By default it lives in a file called “hammie.db”, or (for the Outlook plugin) “default_bayes_database”. • The tokenizer/classifier. This is the core engine of the system. The tokenizer splits emails into tokens (words, roughly speaking), and the classifier looks at those tokens to determine whether the message looks like spam or not. You don’t use the tokenizer/classifier directly – it powers the other parts of the system. • The POP3 proxy. This sits between your email client (Eudora, Outlook Express, etc) and

Related Questions

What is your question?

*Sadly, we had to bring back ads too. Hopefully more targeted.

Experts123