[pcre-dev] [Bug 712] \w class not matched few hungarian char…

Página superior
Eliminar este mensaje
Autor: Philip Hazel
Fecha:  
A: pcre-dev
Asunto: [pcre-dev] [Bug 712] \w class not matched few hungarian characters
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=712

Philip Hazel <ph10@???> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|                            |INVALID





--- Comment #1 from Philip Hazel <ph10@???> 2008-05-19 17:21:34 ---
This is a quote from the pcrepattern man page:

"In UTF-8 mode, characters with values greater than 128 never match \d, \s, or
\w, and always match \D, \S, and \W. This is true even when Unicode
character property support is available. These sequences retain their original
meanings from before UTF-8 support was available, mainly for efficiency
reasons."

In other words, this is not a bug. If you want to test for properties of such
characters, you must use the \p escapes. Note that a bug concerning character
properties was fixed in release 7.5. The current release of PCRE is 7.7.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email