On Sat, 5 Oct 2013, Thierry Douez wrote:
> and trying
> "(*UTF8)\w+(\s+)"
> "(*UTF)\w+(\s+)"
> or "(*UCP)\w+(\s+)"
> doesn't match anything on "abcdéfgh: 55"
You need "(*UTF8)(*UCP)\w+(\s+)" that is, you need to tell PCRE *two*
things:
(1) The text strings are UTF
(2) Use Unicode properties for \w, \s, etc
Philip
--
Philip Hazel