[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Startseite
Nachricht löschen
Autor: Pavel Kostromitinov
Datum:  
To: pcre-dev
Alte Treads: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Betreff: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897




--- Comment #16 from Pavel Kostromitinov <pavel@???> 2010-03-19 13:20:04 ---
Is there any hope that patches for this feature will make their way into main
pcre version? I guess there'll be more people who could appreciate this.

Though there are still some problems I could not solve.

For example, the problem with [\W] - I still cannot figure out how to implement
it.
[^\pL\pN] can not be used, since the set is either inclusive or exclusive - and
if I have [\Wa-f] as input, I cannot convert it ti something
like[(^\pL\pN)a-f]...

And there are problem with pcre_study.c.

Also - there should be some hack for \t to be treated as \s (it is NOT \pZ)


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email