[pcre-dev] [Bug 897] \w and others based on Unicode properti…

Top Page

Reply to this message
Author: Philip Hazel
Date:  
To: pcre-dev
Old-Topics: [pcre-dev] [Bug 897] New: \w and others based on Unicode properties
Subject: [pcre-dev] [Bug 897] \w and others based on Unicode properties
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=897

Philip Hazel <ph10@???> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |FIXED





--- Comment #23 from Philip Hazel <ph10@???> 2010-05-18 16:51:16 ---
I have just committed a patch that adds a new option PCRE_UCP to provide the
facility you want. I did it a completely different way to what we previously
discussed. I added new artificial properties to match "words" and other things,
and then, when PCRE_UCP is set, I made pcre_compile() replace \w, \d, etc. with
\p or \P, using these new properties. This works well, is a much smaller change
to the code, and does not affect performance when PCRE_UCP is not set.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email