[pcre-dev] [Bug 1100] Crash backtracking over unicode sequen…

トップ ページ
このメッセージを削除
著者: Tom Hughes
日付:  
To: pcre-dev
古いトピック: [pcre-dev] [Bug 1100] New: Crash backtracking over unicode sequence
題目: [pcre-dev] [Bug 1100] Crash backtracking over unicode sequence
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1100




--- Comment #9 from Tom Hughes <tom@???> 2011-07-19 11:12:22 ---
We were actually using \X as a convenient approximation for extended grapheme
cluster ;-)

Our use case is that our user interface is defined in terms of characters and
when moving to Unicode we decided that an extended grapheme cluster was the
appropriate thing to consider as being a character.

So when we preprocess user supplied regular expressions (which use a 20+ year
old syntax) into PCRE regular expressions we substitute "\X" for "." in an
attempt to make "." match a character, aka an extended grapheme cluster.


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email