[pcre-dev] [Bug 1208] Case folding in PCRE

Author: Zoltan Herczeg
Date:
To: pcre-dev
Subject: [pcre-dev] [Bug 1208] Case folding in PCRE

------- You are receiving this mail because: -------
You are on the CC list for the bug.

http://bugs.exim.org/show_bug.cgi?id=1208

--- Comment #5 from Zoltan Herczeg <hzmester@???> 2012-02-09 17:50:37 ---
We already have such opcode: \R which matches to (?:\r|\n|\r\n) although it
cannot be used inside [] ranges. If the number of such letters are relatively
low (<5 for example), we might able to introduce special opcodes for them (Is
there any other besides Eszett?). The other opcodes would be implemented as
character ranges. Similar to \h and \v but the list on the list is long and
would require manual maintenance... Anyway if we decide to add multiple cases
thing \h and \v should become part of that new system.

By the way, this would surely be unexpected for an average PCRE user:
- "Μ" (uppercase mu) matches "μ" (lowercase mu);

Or uppercase mu has a different codepoint like different beta characters?

--
Configure bugmail: http://bugs.exim.org/userprefs.cgi?tab=email

This message is part of the following thread:
	the complete thread tree sorted by date
	Philip Hazel at
	Petr Pisar at