[pcre-dev] [Bug 1388] PCRE_UCP seems to ruin Callout pattern…

Top Page
Delete this message
Author: Philip Hazel
Date:  
To: pcre-dev
New-Topics: [pcre-dev] New port for TILE-Gx
Subject: [pcre-dev] [Bug 1388] PCRE_UCP seems to ruin Callout pattern indices
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1388




--- Comment #1 from Philip Hazel <ph10@???> 2013-09-28 18:51:20 ---
On Sat, 28 Sep 2013, firas wrote:

> This is my first bug report so I hope I am doing this correctly.


Seems good to me.

> When I use the automatic callout feature (PCRE_AUTO_CALLOUT) in conjunction
> with PCRE_UCP I get weird values for the pattern position and length from each
> calloutblock.
>
> For example, the regex `\w` with the options mentioned above ran on the string
> `Ƶ` returns the following indices:
> Start: 1
> End: 26190
>
> Start: 26191
> End: 39348


I have had a quick look, using pcretest, and indeed, there is something
weird going on with the example you give, and PCRE_UCP seems to trigger
it. It also seems there are no tests that use both PCRE_UCP and
PCRE_AUTO_CALLOUT.

This bug is now on the list of things to be fixed in due course, and
certainly before the next release. And of course the tests will be
updated...

Thank you for the report.

Philip


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email