Re: [pcre-dev] [Bug 897] \w and others based on Unicode prop…

Top Page
Delete this message
Author: Philip Hazel
Date:  
To: 897
CC: pcre-dev
Subject: Re: [pcre-dev] [Bug 897] \w and others based on Unicode properties
On Mon, 22 Mar 2010, Pavel Kostromitinov wrote:

> > > And there are problem with pcre_study.c.
> > What problems?
> Frankly speaking, I don't remember exactly already, but I had to specify
> PCRE_NO_START_OPTIMIZE for most of the patterns to work.


There was a bug in 8.00, unrelated to Unicode, that could cause this. It
was fixed in 8.01.

> No, it should not match \pZ, that would be against Unicode standard. But then
> \s check should be made not equivalent to \pZ, but rather something like "\pZ
> or isspace()"


OK, but when I do get round to this, I will check to see what Perl does.

Philip

--
Philip Hazel