Hello,
I wonder, why certain hex codes do not match certain
UTF-8 script names. For example, with pcre-7.4:
re> /^[\p{Arabic}]/8
data> \x{06e9}
0: \x{6e9}
data> \x{0654}
No match
data> \x{0658}
No match
data> \x{0656}
No match
I had a glance at UnicodeData.txt from
ftp://ftp.unicode.org/Public/UNIDATA/UnicodeData.txt
and therefore expected these numbers to match:
0654;ARABIC HAMZA ABOVE;Mn;230;NSM;;;;;N;;;;;
0656;ARABIC SUBSCRIPT ALEF;Mn;220;NSM;;;;;N;;;;;
0658;ARABIC MARK NOON GHUNNA;Mn;230;NSM;;;;;N;;;;;
But they don't. Do I miss something? Wrong table or version or
syntax or whatever?
Bye, bye
Juergen