[pcre-dev] /\p{Arabic}/8

Top Page
Delete this message
Author: Juergen Leising
Date:  
To: pcre-dev
Subject: [pcre-dev] /\p{Arabic}/8

Hello,

I wonder, why certain hex codes do not match certain
UTF-8 script names. For example, with pcre-7.4:

re> /^[\p{Arabic}]/8
data> \x{06e9}

0: \x{6e9}
data> \x{0654}

No match
data> \x{0658}

No match
data> \x{0656}

No match


I had a glance at UnicodeData.txt from

    ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.txt


and therefore expected these numbers to match:

0654;ARABIC HAMZA ABOVE;Mn;230;NSM;;;;;N;;;;;
0656;ARABIC SUBSCRIPT ALEF;Mn;220;NSM;;;;;N;;;;;
0658;ARABIC MARK NOON GHUNNA;Mn;230;NSM;;;;;N;;;;;

But they don't. Do I miss something? Wrong table or version or
syntax or whatever?

Bye, bye

Juergen