[pcre-dev] PCRE and UTF-8

Top Page
Delete this message
Author: Ondrej Hoferek
Date:  
To: pcre-dev
Subject: [pcre-dev] PCRE and UTF-8
Hello,

I am using the pcre library and have a small problem with the utf-8
encoded strings. When retrieving the information about the beginning and
end of the matched part of utf-8 encoded subject, I get wrong numbers.
Every symbol that is encoded in more than one byte is counted as that
many symbols as how many bytes it is encoded in. Do you also observe
this problem or have any idea why this happens to me? Thanks for any
advice.

Best regards,
Ondrej Hoferek