[pcre-dev] [Bug 1100] Crash backtracking over unicode sequen…

Top Page
Delete this message
Author: Tom Hughes
Date:  
To: pcre-dev
Old-Topics: [pcre-dev] [Bug 1100] New: Crash backtracking over unicode sequence
Subject: [pcre-dev] [Bug 1100] Crash backtracking over unicode sequence
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1100




--- Comment #9 from Tom Hughes <tom@???> 2011-07-19 11:12:22 ---
We were actually using \X as a convenient approximation for extended grapheme
cluster ;-)

Our use case is that our user interface is defined in terms of characters and
when moving to Unicode we decided that an extended grapheme cluster was the
appropriate thing to consider as being a character.

So when we preprocess user supplied regular expressions (which use a 20+ year
old syntax) into PCRE regular expressions we substitute "\X" for "." in an
attempt to make "." match a character, aka an extended grapheme cluster.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email