[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Inizio della pagina
Delete this message
Autore: Pavel Kostromitinov
Data:  
To: pcre-dev
Vecchi argomenti: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Oggetto: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #18 from Pavel Kostromitinov <pavel@???> 2010-03-22 13:12:38 ---
(In reply to comment #17)
> > And there are problem with pcre_study.c.
> What problems?

Frankly speaking, I don't remember exactly already, but I had to specify
PCRE_NO_START_OPTIMIZE for most of the patterns to work.


> > Also - there should be some hack for \t to be treated as \s (it is NOT \pZ)
>
> Do you mean that you do want it to match \pZ? Or you do want \t to still
> match \s when \w, \d, etc are using Unicode properties?


No, it should not match \pZ, that would be against Unicode standard. But then
\s check should be made not equivalent to \pZ, but rather something like "\pZ
or isspace()"


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email