[pcre-dev] [Bug 2625] Unexpected caseless matching of ASCII …

Top Page

Reply to this message
Author: admin
Date:  
To: pcre-dev
Subject: [pcre-dev] [Bug 2625] Unexpected caseless matching of ASCII "s" when using "[\x{00FF}-\x{FFEE}]" in UTF-16 text
https://bugs.exim.org/show_bug.cgi?id=2625

--- Comment #2 from Rich Siegel <siegel@???> ---
(In reply to Zoltan Herczeg from comment #1)
> This is expected. Please check:
> http://www.unicode.org/Public/12.1.0/ucd/CaseFolding.txt


Thank you! I'm a little ashamed to admit that Unicode case folding didn't even
occur to me. This may pose more of a UI and/or documentation challenge, since
I'm quite certain that it won't occur to end users either. :-)

Does it make sense to consider a flag in the PCRE2_EXTRA* space which would
limit case folding to the ASCII range when PCRE2_CASELESS is specified? (I'm
not yet advocating for it; I can see some clear limitations and disadvantages,
and trying to express all of the possible variations could rapidly turn into a
snake pit.)

--
You are receiving this mail because:
You are on the CC list for the bug.