Re: [xml] caret in regexp character class



On Tue, Mar 7, 2017 at 2:48 PM, Per Hedeland  wrote:
Hello,

With libxml2-2.9.4, the regular expression [ab^cd] is equivalent to
[^cd], i.e. it matches all characters except 'c' and 'd'. However from
my reading of
https://www.w3.org/TR/2001/REC-xmlschema-2-20010502/#regexs, (1) the
caret has no special meaning unless it is the first character of the
charGroup in a charClassExpr, i.e. the above regexp should match the
characters 'a', 'b', '^', 'c', and 'd'.

...

I.e. I believe the current behavior is a bug, and that the correct
behavior is (1).

I believe you are correct.

Csaba
-- 
GCS a+ e++ d- C++ ULS$ L+$ !E- W++ P+++$ w++$ tv+ b++ DI D++ 5++
The Tao of math: The numbers you can count are not the real numbers.
Life is complex, with real and imaginary parts.
"Ok, it boots. Which means it must be bug-free and perfect. " -- Linus Torvalds
"People disagree with me. I just ignore them." -- Linus Torvalds


[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]