Author: Ze'ev Atlas Date: To: pcre-dev@exim.org Subject: Re: [pcre-dev] issues with EBCDIC and pcretest
Hi Philip
In developing the EBCDIC test suite, I am using this Methodology:1. I am interested only in testinput1, testinput2 and partially in testinput11 and testinput14.2. Compare pcretest results from the standard test suite to the z/OS EBCDIC results (after converting to ASCII - this is all a pain in the neck :)3. Ignore all same results or where the differences are obvious (0x0a vs. 0x15, Caret vs. logical not) and where the EBCDIC results are matching correctly, if different.4. For all remaining issues, I've developed a small test suite that either demonstrate how to code the pattern so that it would match, or overcome issues such as different file name scheme [for saved regexes], etc.
Once done, I will provide an ASCII image of the EBCDIC suite (standard EBCDIC output, my input files, and standard output from those files). EBCDIC images will be available on my distro, unless you want them with your test suite.
I would like to report that testinput1, testinput2 and their specialized EBCDIC derivatives work perfectly, except of three issues that are not clear to me:
a. As far as I can tell, 0xa0 is supposed to be a non-breaking space! Am I correct? If so, is there an obvious EBCDIC equivalent?b. Could you please guide me to any documentation about class in the from [!xxx!], so I can adjust my tests.c. What is 0x85 used for? Ze'ev Atlas