[pcre-dev] [Bug 1231] New: JIT bug with lookbehind assertion…

Top Page
Delete this message
Author: Malcolm Parsons
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 1231] New: JIT bug with lookbehind assertion in UTF8 mode
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1231
           Summary: JIT bug with lookbehind assertion in UTF8 mode
           Product: PCRE
           Version: 8.30
          Platform: Other
        OS/Version: Linux
            Status: NEW
          Severity: bug
          Priority: medium
         Component: Code
        AssignedTo: ph10@???
        ReportedBy: malcolm.parsons@???
                CC: pcre-dev@???



With JIT enabled in UTF8 mode, the regexp
'(?i)(?<!b|th|z|m|p|^)eta(?!\w)' unexpectedly matches ETA:

$ ./pcregrep -V
pcregrep version 8.30 2012-02-04
$ echo ETA | ./pcregrep --no-jit '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep -u --no-jit '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep -u '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
ETA

No change in trunk:

$ ./pcregrep -V
pcregrep version 8.31-RC1 2012-02-012
$ echo ETA | ./pcregrep --no-jit '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep -u --no-jit '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
$ echo ETA | ./pcregrep -u '(?i)(?<!b|th|z|m|p|^)eta(?!\w)'
ETA


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email