[pcre-dev] PCRE and UTF-8

Páxina inicial
Borrar esta mensaxe
Autor: Ondrej Hoferek
Data:  
Para: pcre-dev
Asunto: [pcre-dev] PCRE and UTF-8
Hello,

I am using the pcre library and have a small problem with the utf-8
encoded strings. When retrieving the information about the beginning and
end of the matched part of utf-8 encoded subject, I get wrong numbers.
Every symbol that is encoded in more than one byte is counted as that
many symbols as how many bytes it is encoded in. Do you also observe
this problem or have any idea why this happens to me? Thanks for any
advice.

Best regards,
Ondrej Hoferek