Re: [Exim] 8-bit in From: and Subject:

Top Page
Delete this message
Reply to this message
Author: Steven A. Reisman
Date:  
To: exim-users
Subject: Re: [Exim] 8-bit in From: and Subject:
On Thu, Nov 21, 2002 at 10:30:09PM +0100, Arnulv Rudland wrote:
> > >From my #Exim filter:
> >
> >    if $h_subject matches [\\x80-\\xff]{5}  ...

>
> why do you need to get rid of them? I receive a lot of mails with encoded
> subjects:


Fair question...

I've got a dozen or more email addresses that funnel into my mailbox.
Consequently, I get a tremendous amount of spam.

My filter first sorts mailing list email into separate mailboxes.
Then it agressively looks for anything resembling spam and throws that
into a spam mailbox which I periodically review. The test above looks
for 5 non-ASCII characters in a row which catches a lot of Asian spam,
such as:

   Subject: <<B1><A4><B0><ED>> <C8><AB><BA><B8><B0><A1>
      <C7><CA><BF><E4><C7><CF><BD><C3><C1><D2>?! <C0><CC><C1><A6>
      <C0><CC><B8><DE><C0><CF><B8><B6><C4><C9><C6><C3><C0><B8><B7><CE>
      <BD><C3><C0><DB><C7><CF><BD><CA><BD><C3><BF><E4>!!!


(I hope that subject line wasn't too racy.)


> here an example:
>
> "Subject: =?iso-8859-1?Q?Olympia_2012_/_MV_M=E4rz_der_WJ_L=FCbeck?="
>
> which in the receiving MUA translates to:
>
> "Subject: Olympia 2012 / MV M&auml;rz der WJ L&uuml;beck"
>
> (%auml; and &uuml; of course displayed as the correct umlaut)
>
> wouldn't it be better to have a filter doing the straight-foward encoding?
> in worst case a perl one-liner.


Without 5 consecutive non-ASCII characters, it would not be caught by
my test.


Steve


----------------------------------------------------------------------
Steven A. Reisman          <sar@???>           P.O. Box 409
PressEnter LLP                                        421 N 2nd Street
715-426-2100 or 651-436-5254                    River Falls, WI  54022
----------------------------------------------------------------------