Re: [pcre-dev] Property Codes, Character Classes and non-UTF…

Inizio della pagina
Delete this message
Autore: Sheri
Data:  
To: pcre-dev
Oggetto: Re: [pcre-dev] Property Codes, Character Classes and non-UTF8 mode
Philip Hazel wrote:
> On Sat, 25 Oct 2008, Sheri wrote:
>
>
>> We've previously established and documented that Unicode property codes
>> do work in non-UTF8 mode for characters up to 255.
>>
>
> Presumably using Unicode encoding?
>

ANSI

"The ANSI character set, also known as Windows-1252, has become a
Microsoft proprietary character set; it is a superset of ISO-8859-1 with
the addition of 27 characters in locations that ISO designates for
control codes."

http://www.alanwood.net/demos/charsetdiffs.html
>
>> But the documentation says that \p and \P can be used in character
>> classes. In character classes, they seem to work only up to 128. Bug?
>>
>
> It would seem so. The behaviour should obviously be the same in and
> outside character classes.
>
> I've put this on my list to look at when I next work on PCRE. Thanks for
> the report.
>

Thank you!
Sheri