How to write spam filter

Eric picture Eric · Nov 17, 2008 · Viewed 8.3k times · Source

I'm stuck in having to write a simple spam filter I'm not really sure about how I'm going to do it.

So far I've come up with wordlist and domain filtering, which will give or remove points up to a certain threshold.

For example, if you're writing about "v1agr4" from a blacklisted domain, you'll get like 2 points for spam, but if you're writing about "v1agr4" from a hotmail.com account, you'll get only 1 "spam point".

Do you guys have any other suggestions / ressources?

This is more about learning spam filters than developing something enterprise grade

Answer

BoltBait picture BoltBait · Nov 17, 2008

Some really good algorithm info here:

http://www.paulgraham.com/spam.html

http://www.paulgraham.com/better.html

But, seriously, why reinvent the wheel?

Just download K9: http://keir.net/k9.html