https://bugs.exim.org/show_bug.cgi?id=2998
Bug ID: 2998
Summary: utf8clean should mask surrogate code points (U00D800
to U00DFFFF)
Product: Exim
Version: N/A
Hardware: All
OS: All
Status: NEW
Severity: bug
Priority: medium
Component: String expansion
Assignee: unallocated@???
Reporter: jasen@???
CC: exim-dev@???
Created attachment 1444
-->
https://bugs.exim.org/attachment.cgi?id=1444&action=edit
patch to fix.
Unicode code points \U00D800 to \U00DFFF only have meaning in UTF-16 (where
they are used in surrogate pairs) they should not appear in UTF-8 (see
https://www.ietf.org/rfc/rfc2279.txt section 2)
To this end ${utf8clean: should mask them.
patch attached.
--
You are receiving this mail because:
You are on the CC list for the bug.