Re: [Exim] Regular Expression - Spam Filtering

Pàgina inicial
Delete this message
Reply to this message
Autor: Dave C.
Data:  
A: Miquel van Smoorenburg
CC: exim-users
Assumpte: Re: [Exim] Regular Expression - Spam Filtering
On Wed, 28 Nov 2001, Miquel van Smoorenburg wrote:

> In article <3C053234.6CC18BE5@???>,
> Marc Perkel <marc@???> wrote:
> >I want to catch spam based on wirds inside of url's within a message.
> >For example, if the url contains "fetish" it's a spammer. But I only
> >want to test the domain name and not the web page.
> >
> >So
> >
> >http://www.fetish.com/test.htm
> >
> >is a match - but -
> >
> >http://www.test.com/fetish/test
> >
> >is NOT a match
> >
> >The idea is to test the text IN BETWEEN the http and the first / or
> >space.
> >
> >How do I write a regular expression to do this?
>
> You really should read up on regular expressions, because this
> is a regexp 101 question. Anyway:


Do you know where one can obtain a good copy of "Regexp 101"?? I'm not
a complete novice, but I would also like a good regexp tutorial to pick
up on all the things I'm sure I'm not aware of.

Something that was cognizant of the minor syntax differences between
perl, grep, and other regexp implementations would be nice, too..

>
> http://[^/]*(word_here|anotherword|morestuff)[^/]*/
>
> Mike.
>


--