[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Top Page
Delete this message
Author: Pavel Kostromitinov
Date:  
To: pcre-dev
Old-Topics: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Subject: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #22 from Pavel Kostromitinov <pavel@???> 2010-03-23 12:06:09 ---
> > No, there was something (if I remember correctly) about pcre_study making a
> > bitmask of what characters expected at the start of pattern, and it was
> > suitable only for 256 chars.
>
> I've made a note to check that out, but my recollection is that the
> bitmask is of which BYTES are expected at the start of the pattern.
>

I meant that filling it with correct values if \w is at the start of a pattern
becomes more complicated.

I should rephrase - not that there are problems with existing pcre_study()
code, but rather I had problems modifying it for new \w etc handling.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email