Re: [xml] less-than character and HTML parser module

On 13 Apr 2015, at 22:43, Christian Schoenebeck <schoenebeck crudebyte com> wrote:


I just encountered an issue with stand-alone less-than characters if the 
document is parsed by libxml2's HTML parser module. Consider you have a text 
in your HTML document like:

      a < b

The less-than sign in this case is interpreted by the HTML parser module as 
tag start, causing subsequent text (in this case "< b") to be dropped.

Isn't that correct? Shouldn't your document have

     a &lt; b

in instead?

Alex Bligh

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]