[pcre-dev] PCRE and UTF-8

Author: Ondrej Hoferek
Date:
To: pcre-dev
Subject: [pcre-dev] PCRE and UTF-8

Hello,

I am using the pcre library and have a small problem with the utf-8
encoded strings. When retrieving the information about the beginning and
end of the matched part of utf-8 encoded subject, I get wrong numbers.
Every symbol that is encoded in more than one byte is counted as that
many symbols as how many bytes it is encoded in. Do you also observe
this problem or have any idea why this happens to me? Thanks for any
advice.

Best regards,
Ondrej Hoferek

This message is part of the following thread:
	the complete thread tree sorted by date

	Philip Hazel at