[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Top Page
Delete this message
Author: Pavel Kostromitinov
Date:  
To: pcre-dev
Old-Topics: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Subject: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #18 from Pavel Kostromitinov <pavel@???> 2010-03-22 13:12:38 ---
(In reply to comment #17)
> > And there are problem with pcre_study.c.
> What problems?

Frankly speaking, I don't remember exactly already, but I had to specify
PCRE_NO_START_OPTIMIZE for most of the patterns to work.


> > Also - there should be some hack for \t to be treated as \s (it is NOT \pZ)
>
> Do you mean that you do want it to match \pZ? Or you do want \t to still
> match \s when \w, \d, etc are using Unicode properties?


No, it should not match \pZ, that would be against Unicode standard. But then
\s check should be made not equivalent to \pZ, but rather something like "\pZ
or isspace()"


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email