Re: [pcre-dev] pcre not matching unicode characters

Top Page
Delete this message
Author: ph10
Date:  
To: Jayaprakasam, Kannan
CC: pcre-dev@exim.org
Subject: Re: [pcre-dev] pcre not matching unicode characters
On Thu, 17 Oct 2013, Jayaprakasam, Kannan wrote:

> The pcre readme says
> If you want to make use of the support for UTF-8 Unicode character strings in
> the 8-bit library, or UTF-16 Unicode character strings in the 16-bit library,
> or UTF-32 Unicode character strings in the 32-bit library, you must add
> --enable-utf to the "configure" command.


"You" in that sentence means the person who is building the PCRE
library.

> Without it, the code for handling UTF-8, UTF-16 and UTF-8 is not
> included in the relevant library. Even when --enable-utf is
> included, the use of a UTF encoding still has to be enabled by an
> option at run time.
>
> What is this option at runtime for utf encoding being talked about? I
> could not find any such option in the pcre apis.


--enable-utf is a *build-time* option. If it is not used, the library
does not support UTF. You can check whether your library is compiled to
support UTF by running

pcretest -C

Philip

--
Philip Hazel