Author: Peter Bowyer Date: To: Exim, Users New-Topics: [exim] Exim based FuzzyOCR Subject: Re: [exim] SPAM Filtering - Losing the war!
On 25/10/06, Marc Perkel <marc@???> wrote: >
>
> Peter Bowyer wrote:
> On 25/10/06, Marc Perkel <marc@???> wrote: > Wakko Warner wrote: >
>
> Have you tried gocr? I have, it's not the greatest, but it is possible
> that it can read the image and allow you to react on the content of the > image. It's not that great yet, but I think it could have potential (until > spammers start using the other characters in the images like they are in > text spam)
> I have started to install it several times and got distracted. I think
> I will try it.
> We've just implemented the FuzzyOCR plugin for SA, which uses gocr under the > covers. It's catching 100% of the current wave of image spam with no FPs > reported yet. Very impressed.
Since the OCR is farly expensive I might look > at pulling out the SA plugin bits and implementing the guts as a perl module > to call directly from Exim (before SA), where I can be a bit more choosy > about which attachments I choose to process.
Peter
>
> That would be great if we can create an Exim equiv of the Fuzzy OCR code.
> I'd rather drop it in Exim so that SA never sees it and therefore never gets
> to poison my bayes filters.
>
Agreed - the example posted earlier is by its own admission not
'fuzzy' - I'll do some testing on it and see how it performs. Maybe it
needs a bit of the fuzziness from FuzzyOCR.