[pcre-dev] [Bug 1437] Using PCRE-8.34 on x86-64 Linux with …

Startseite
Nachricht löschen
Autor: Zoltan Herczeg
Datum:  
To: pcre-dev
Betreff: [pcre-dev] [Bug 1437] Using PCRE-8.34 on x86-64 Linux with --enable-jit and --enable-utf , grep -iP '^S' gets stuck on a binary file consuming a lot of CPU for many seconds
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1437

Zoltan Herczeg <hzmester@???> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
                 CC|                            |hzmester@???





--- Comment #1 from Zoltan Herczeg <hzmester@???> 2014-01-23 00:11:06 ---
Thanks for the bug report. I tried your pattern and input on the latest trunk,
and matched /^S/ 100000 times to your input. The JIT runtime was 0.16 sec, the
interpreter was 1.14 sec.

Am I see right that you use PCRE-8.33? That is a bit old revision, and several
improvements were added since then (e.g. 'S' has three lowercases, and matching
such letters was improved recently). Furthermore your input is not a valid UTF8
stream, and the matching behaviour is not defined in such cases.

Would it be possible to dump the input of pcre_exec calls in grep? I suspect
something is bad ('size' argument for example) with some input buffers.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email