Re: [pcre-dev] issues with EBCDIC and pcretest

Top Page
Delete this message
Author: Ze'ev Atlas
Date:  
To: pcre-dev@exim.org
Subject: Re: [pcre-dev] issues with EBCDIC and pcretest
Hi PhilipThe \p and \P within a class seem to be resolved
Regarding [a-zA-Z] as special cases.  I will begin to work on PCRE2 sometime soon (after 10.20 final is out).  If you want me to take a look and suggest a patch, I am willing to do so.  At least I could try, but it would take time.
Please see the attached file with two remaining issues that I can see.
#1 \x41 seems not to be there
#2 The below is still an issue> /[\h]/BZ                                                          
------------------------------------------------------------------
        Bra                                                       
        [\x05\x0b-\x0d\x15\x25 ]                                  
        Ket                                                       
        End                                                      
------------------------------------------------------------------


Ze'ev Atlas




/-- This set of tests is run only with the 8-bit library. They do not require
    UTF-8 or Unicode property support. The file starts with all the tests of
    the POSIX interface, because that is supported only with the 8-bit library.
    --/

< forbid 8W

/\h/SI
Capturing subpattern count = 0
No options
No first char
No need char
Subject length lower bound = 1
Starting chars: \x05 \x40

/\v/SI
Capturing subpattern count = 0
No options
No first char
No need char
Subject length lower bound = 1
Starting chars: \x0b \x0c \x0d \x15 \x25

/\R/SI
Capturing subpattern count = 0
No options
No first char
No need char
Subject length lower bound = 1
Starting chars: \x0b \x0c \x0d \x15 \x25

/[\h]/BZ
------------------------------------------------------------------
        Bra
        [\x00\x05\x0b-\x0d\x15\x25  ]
        Ket
        End
------------------------------------------------------------------
    >\x05<
 0: \x05

/[\H]/BZ
------------------------------------------------------------------
        Bra
        [\x00-\xff]
        Ket
        End
------------------------------------------------------------------

/[¬\h]/BZ
------------------------------------------------------------------
        Bra
        [\x01-\x04\x06-\x0a\x0e-\x14\x16-\x24\x26-\x3fâ-\xff] (neg)
        Ket
        End
------------------------------------------------------------------

/[\v]/BZ
------------------------------------------------------------------
        Bra
        [\x0b-\x0d\x15\x25]
        Ket
        End
------------------------------------------------------------------

/[\V]/BZ
------------------------------------------------------------------
        Bra
        [\x00-\x0a\x0e-\x14\x16-\x24\x26-\xff]
        Ket
        End
------------------------------------------------------------------

/[\x0a\V]/BZ
------------------------------------------------------------------
        Bra
        [\x00-\x0a\x0e-\x14\x16-\x24\x26-\xff]
        Ket
        End
------------------------------------------------------------------

/[\R]/BZ
------------------------------------------------------------------
        Bra
        R
        Ket
        End
------------------------------------------------------------------

/[\s]/BZ
------------------------------------------------------------------
        Bra
        [\x05\x0b-\x0d\x15 ]
        Ket
        End
------------------------------------------------------------------

/[[:space:]]/BZ
------------------------------------------------------------------
        Bra
        [\x05\x0b-\x0d\x15 ]
        Ket
        End
------------------------------------------------------------------

/[\w]/BZ
------------------------------------------------------------------
        Bra
        [_a-ij-rs-zA-IJ-RS-Z0-9]
        Ket
        End
------------------------------------------------------------------

/[[:alpha:]]/BZ
------------------------------------------------------------------
        Bra
        [_a-ij-rs-zA-IJ-RS-Z]
        Ket
        End
------------------------------------------------------------------

/[a-z]/BZ
------------------------------------------------------------------
        Bra
        [a-z]
        Ket
        End
------------------------------------------------------------------

/[A-Z]/BZ
------------------------------------------------------------------
        Bra
        [A-Z]
        Ket
        End
------------------------------------------------------------------

/[a-zA-Z]/BZ
------------------------------------------------------------------
        Bra
        [a-zA-Z]
        Ket
        End
------------------------------------------------------------------

/[a-Z]/BZ
------------------------------------------------------------------
        Bra
        [a-Z]
        Ket
        End
------------------------------------------------------------------

/[[!xxx!]]/BZ
------------------------------------------------------------------
        Bra
        [!x[]
        ]
        Ket
        End
------------------------------------------------------------------