On Tuesday 14 April 2015 17:50:51 you wrote:
If anything like this does get put in, it should only be if it is a configurable option that is disabled by default - an xml parser should only accept a strictly-conforming document by default. Adding support for ‘broken’ html because other (weak) parsers allow it is not a good plan as it causes divergence from the standard.
There you go; you find the updated patch attached. It now requires HTML_PARSE_RECOVER option to be set for recovering from stand-alone less-than characters. Best regards, Christian Schoenebeck
Attachment:
libxml2-less-than-char.patch
Description: Text Data