Re: [xml] Cleaning the Web - Implementing HTML 5 parsing in libxml2

From: Michael Day <mikeday yeslogic com>
To: veillard redhat com
Cc: xml gnome org, "Michael\(tm\) Smith" <mike w3 org>, Dan Connolly <connolly w3 org>, Chris Wilson <Chris Wilson microsoft com>
Subject: Re: [xml] Cleaning the Web - Implementing HTML 5 parsing in libxml2
Date: Sat, 09 Aug 2008 18:56:35 +1000

Hi Daniel,

  I know that some people like Michael Day rely heavilly on the HTML
parser behaviour, and would very much like to hear from them too, as

the change would have more impact on them than me. I know the Webkitproject uses libxml2 but only for parsing XML, and wonder if an HTML5

compliant parser in libxml2 might change this or not. Even if this
wasn't the case I would like to see HTML5 suport in, but being able
to assert possible impact like this would be nice.

We do use the libxml2 HTML parser in our Prince formatter, and it worksvery nicely. It does have limitations when it comes to "tag soup" HTMLand browser compatibility, and ideally these could be solved byfollowing the HTML5 specification, without affecting the currentbehaviour for valid documents.


In summary: it would be great if libxml2 was also a HTML5 parser!

Is anyone available to implement it? :)

Best regards,

Michael

--
Print XML with Prince!
http://www.princexml.com

References:
- [xml] Cleaning the Web - Implementing HTML 5 parsing in libxml2
  - From: Karl Dubost
- Re: [xml] Cleaning the Web - Implementing HTML 5 parsing in libxml2
  - From: Daniel Veillard

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]