I'm stuck in having to write a simple spam filter I'm not really sure about how I'm going to do it.
So far I've come up with wordlist and domain filtering, which will give or remove points up to a certain threshold.
For example, if you're writing about "v1agr4" from a blacklisted domain, you'll get like 2 points for spam, but if you're writing about "v1agr4" from a hotmail.com account, you'll get only 1 "spam point".
Do you guys have any other suggestions / ressources?
This is more about learning spam filters than developing something enterprise grade
Some really good algorithm info here:
http://www.paulgraham.com/spam.html
http://www.paulgraham.com/better.html
But, seriously, why reinvent the wheel?
Just download K9: http://keir.net/k9.html