Re: [pcre-dev] PCRE UTF8 umlauts CASELESS matching

Kezdőlap
Üzenet törlése
Szerző: Philip Hazel
Dátum:  
Címzett: Enno
CC: pcre-dev
Tárgy: Re: [pcre-dev] PCRE UTF8 umlauts CASELESS matching
On Thu, 1 Nov 2007, Enno wrote:

> PCRE version 6.7 04-Jul-2006


That's quite old. Current release is 7.4 (September 2007). My testing
using pcretest shows that this works:

re> /\x{c4}/8i
data> \x{c4}

0: \x{c4}
data> \x{e4}

0: \x{e4}

That is using the hex values for A-umlaut and a-umlaut. But it also
works with the binary, provided that you are using UTF-8 strings. That
is, A-umlaut is encoded as the two bytes C3 84 and a-umlaut is C3 A4.

Philip

--
Philip Hazel