[pcre-dev] Property Codes, Character Classes and non-UTF8 mo…

Top Page
Delete this message
Author: Sheri
Date:  
To: pcre-dev
New-Topics: Re: [pcre-dev] Property Codes, Character Classes and non-UTF8 mode
Subject: [pcre-dev] Property Codes, Character Classes and non-UTF8 mode
We've previously established and documented that Unicode property codes
do work in non-UTF8 mode for characters up to 255.

But the documentation says that \p and \P can be used in character
classes. In character classes, they seem to work only up to 128. Bug?

pcretest
PCRE version 7.8 2008-09-05

re> /(?:\p{Lu}|\x20)+/
data> \x41\x20\x50\xC2\x54\xC9\x20\x54\x4F\x44\x41\x59

0: A P\xc2T\xc9 TODAY
re> /[\p{Lu}\x20]+/
data> \x41\x20\x50\xC2\x54\xC9\x20\x54\x4F\x44\x41\x59

0: A P

Regards,
Sheri