On Wed, Nov 29, 2006, David Saez Padros wrote:
>> Any comments or criticism?
> one good way will be to also serach for common html patterns (for
> image spam)
Requires a lot of hand-picking, unlike text/plain matching, which
involves no manual operations at all.
Here's what I got:
The SA ruleset <
http://tehran.lain.pl/stuff/sa/w_popular_html.cf>
Masscheck results <
http://tehran.lain.pl/stuff/sa/w_popular_html.txt>
I don't have much text/html ham, so these would probably generate lots
of false positives. Testing would be appreciated, though!