Re: [pcre-dev] PCRE with UTF-8

Top Page
Delete this message
Author: Manohar S
Date:  
To: pcre-dev
Subject: Re: [pcre-dev] PCRE with UTF-8
hi sorry for the very late reply.
The pattern is as you have given earlier..
pattern = [\'\"][\x{80}-\x{ffff}a-zA-Z0-9]+[\'\"];
String = "select * from account where a = 'ਠਡਢಉಉವಷಡಢಣತಥವಷಡಢಣತಥ';"

and string actually read from a file and it will be read in UTF-8 form..
then i am doin pcre_compile and pcre_exec with PCRE_UTF8.
the RC got is 1, but ovector[0] and ovector[1] are filled with 0 :(..

also I tried with this [\200-\377]* (Mr. Zack gave this to me.. decimal
range) but i got exactly same result.

hope this helps for reproducing and waiting for your reply,
Manohar.s

On Sat, Jun 14, 2008 at 4:56 PM, Philip Hazel <ph10@???> wrote:

> On Fri, 13 Jun 2008, Manohar S wrote:
>
> > With this, I am having a successful regex match (matched count = 1), but
> my
> > offset vector always shows 0.. is there any bug with this?
>
> What is the pattern?
> What is the data string?
>
> --
> Philip Hazel
>