[pcre-dev] [Bug 1419] PCRE slower at matching UTF8 character…

Top Page
Delete this message
Author: Philip Hazel
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 1419] PCRE slower at matching UTF8 character classes.
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1419




--- Comment #10 from Philip Hazel <ph10@???> 2013-12-15 15:32:50 ---
On Sun, 15 Dec 2013, Zoltan Herczeg wrote:

> I commented out the following lines in pcre_exec:
>
>           if (utf)
>             ACROSSCHAR(start_match < end_subject, *start_match,
>               start_match++);

>
> All regression tests are passed (not surprisingly, since JIT always worked this
> way), and the performance is now similar in the first two cases even in the
> interpreter (without utf check).


I tried that patch on this ancient x86 box, and it did not seem to make
much difference - the first two cases are still different.

But as you say, work on optimizing this must wait till after 8.34 is
released ... I am already running final checks and tests.

Regards,
Philip


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email