[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Startseite
Nachricht löschen
Autor: Philip Hazel
Datum:  
To: pcre-dev
Alte Treads: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Betreff: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #19 from Philip Hazel <ph10@???> 2010-03-22 16:31:34 ---
On Mon, 22 Mar 2010, Pavel Kostromitinov wrote:

> > > And there are problem with pcre_study.c.
> > What problems?
> Frankly speaking, I don't remember exactly already, but I had to specify
> PCRE_NO_START_OPTIMIZE for most of the patterns to work.


There was a bug in 8.00, unrelated to Unicode, that could cause this. It
was fixed in 8.01.

> No, it should not match \pZ, that would be against Unicode standard. But then
> \s check should be made not equivalent to \pZ, but rather something like "\pZ
> or isspace()"


OK, but when I do get round to this, I will check to see what Perl does.

Philip


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email