[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Góra strony
Delete this message
Autor: Pavel Kostromitinov
Data:  
Dla: pcre-dev
Stare tematy: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Temat: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #18 from Pavel Kostromitinov <pavel@???> 2010-03-22 13:12:38 ---
(In reply to comment #17)
> > And there are problem with pcre_study.c.
> What problems?

Frankly speaking, I don't remember exactly already, but I had to specify
PCRE_NO_START_OPTIMIZE for most of the patterns to work.


> > Also - there should be some hack for \t to be treated as \s (it is NOT \pZ)
>
> Do you mean that you do want it to match \pZ? Or you do want \t to still
> match \s when \w, \d, etc are using Unicode properties?


No, it should not match \pZ, that would be against Unicode standard. But then
\s check should be made not equivalent to \pZ, but rather something like "\pZ
or isspace()"


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email