[pcre-dev] [Bug 1208] Case folding in PCRE

Top Page

Reply to this message
Author: Zoltan Herczeg
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 1208] Case folding in PCRE
------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1208




--- Comment #5 from Zoltan Herczeg <hzmester@???> 2012-02-09 17:50:37 ---
We already have such opcode: \R which matches to (?:\r|\n|\r\n) although it
cannot be used inside [] ranges. If the number of such letters are relatively
low (<5 for example), we might able to introduce special opcodes for them (Is
there any other besides Eszett?). The other opcodes would be implemented as
character ranges. Similar to \h and \v but the list on the list is long and
would require manual maintenance... Anyway if we decide to add multiple cases
thing \h and \v should become part of that new system.

By the way, this would surely be unexpected for an average PCRE user:
- "Μ" (uppercase mu) matches "μ" (lowercase mu);

Or uppercase mu has a different codepoint like different beta characters?


--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email