Re: [xml] LibXML Incorrectly Parses Tables by Omitting Implied TBODY



On Fri, Sep 23, 2011 at 9:05 AM, Glen Hein  wrote:
On Fri, 2011-09-23 at 08:44 +0200, Ralf Junker wrote:

On 23.09.2011 08:21, Alex Bligh wrote:

libxml parses XML not HTML.

Wrong. libxml parses XML _and_ HTML. Documented here:

  http://www.xmlsoft.org/html/libxml-HTMLparser.html

Yes, but libxml doesn't claim to be the world's best html parser:

"should be able to parse "real world" HTML, even if severely broken from a
specification point of view"

It's documented, therefore it can be called a feature, not a bug :)

Csaba
-- 
GCS a+ e++ d- C++ ULS$ L+$ !E- W++ P+++$ w++$ tv+ b++ DI D++ 5++
The Tao of math: The numbers you can count are not the real numbers.
Life is complex, with real and imaginary parts.
"Ok, it boots. Which means it must be bug-free and perfect. " -- Linus Torvalds
"People disagree with me. I just ignore them." -- Linus Torvalds



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]