[pcre-dev] [Bug 1100] Crash backtracking over unicode sequen…

Góra strony
Delete this message
Autor: Tom Hughes
Data:  
Dla: pcre-dev
Stare tematy: [pcre-dev] [Bug 1100] New: Crash backtracking over unicode sequence
Temat: [pcre-dev] [Bug 1100] Crash backtracking over unicode sequence
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1100




--- Comment #9 from Tom Hughes <tom@???> 2011-07-19 11:12:22 ---
We were actually using \X as a convenient approximation for extended grapheme
cluster ;-)

Our use case is that our user interface is defined in terms of characters and
when moving to Unicode we decided that an extended grapheme cluster was the
appropriate thing to consider as being a character.

So when we preprocess user supplied regular expressions (which use a 20+ year
old syntax) into PCRE regular expressions we substitute "\X" for "." in an
attempt to make "." match a character, aka an extended grapheme cluster.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email