[pcre-dev] [Bug 2670] New: \R erroneous match against partic…

Top Page
Delete this message
Author: admin
Date:  
To: pcre-dev
New-Topics: [pcre-dev] [Bug 2670] \R erroneously matches particular UTF-8 characters 0x{C3, C4, C5, ...}85, [pcre-dev] [Bug 2670] \R erroneously matches particular UTF-8 characters 0x{C3, C4, C5, ...}85, [pcre-dev] [Bug 2670] \R erroneously matches particular UTF-8 characters 0x{C3, C4, C5, ...}85
Subject: [pcre-dev] [Bug 2670] New: \R erroneous match against particular UTF-8 character 0xC485
https://bugs.exim.org/show_bug.cgi?id=2670

            Bug ID: 2670
           Summary: \R erroneous match against particular UTF-8 character
                    0xC485
           Product: PCRE
           Version: N/A
          Hardware: x86
                OS: Windows
            Status: NEW
          Severity: bug
          Priority: medium
         Component: Code
          Assignee: Philip.Hazel@???
          Reporter: laimonas.vebra@???
                CC: pcre-dev@???


preg_match("/\R/", 'ąbb'); // ą – 0xC485 (UTF-8)
preg_split("/\R/", 'ąbb');
//(and other preg_* functions using \R)

LANG=C echo -e "\xC4\x85" | grep -nP '\R'

Windows (Cygwin), Linux, various, different versions of PCRE.
'\R' won't match other UTF-8 chars (at least dozens i've tried), just this one.

Yes, i'm aware of /u modifier (for preg_*) and it works normally with UTF-8
locales, but anyway seems like bug.

--
You are receiving this mail because:
You are on the CC list for the bug.