[pcre-dev] [Bug 2555] Slowdown in 16/32 bit compiling

Top Page
Delete this message
Author: admin
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 2555] Slowdown in 16/32 bit compiling
https://bugs.exim.org/show_bug.cgi?id=2555

--- Comment #5 from Giuseppe D'Angelo <dangelog@???> ---
(In reply to Philip Hazel from comment #4)
> The input to pcre2test is always 8-bit. If the utf option is set, that
> pattern and its test data are interpreted as UTF-8 characters. If you set
> the -16 or -32 options, the characters that are defined in UTF-8 are
> converted to UTF-16 or UTF-32 before being passed to the relevant PCRE2
> library. Thus, the same input works for all 3 libraries.


OK, thanks for confirming. Then indeed the original attachment is OK (it's the
UTF-16 pattern I had from fuzzing, converted and saved in UTF-8).

> This particular pattern fails fast in 8-bit mode because it is too big. If
> you compile --with-link-size=3 the 8-bit pattern behaves in the same way as
> the 16- and 32-bit patterns - that is, it is accepted but JIT takes a long
> time.


Great catch! I didn't think of trying a different link size.

So, given the above: is it deemed a problem or not that merely (JIT)compiling a
pattern can take a long time? Is the user expected to never compile/validate an
untrusted regexp?

--
You are receiving this mail because:
You are on the CC list for the bug.