Re: [xml] libxml2 2.7.1 breaks XML serialisation of HTML trees



On Wed, Sep 10, 2008 at 08:08:53AM +0200, Stefan Behnel wrote:
Hi,

Martin (gzlist) wrote:
On 08/09/2008, Stefan Behnel <stefan_ml behnel de> wrote:
 there was a change in 2.7.1 (xmlsave.c, ~760) that prevents HTML documents
 from being serialised in XML style...
 ...
 If the current behaviour is wanted, what's the future way of achieving
 this *without* temporarily modifying the document? (i.e. without breaking
 thread concurrency)

I have been eyeing the other 28 bits of xmlSaveOption recently, mostly
to add a XML_SAVE_XHTML to go counter to the current XML_SAVE_NO_XHTML
that would unconditionally turn *on* the Appendix C rules without
needing one of the XHTML 1.0 doctypes.

  Stephan, Martin,

could you check the enclosed patch ? I'm commiting it to SVN head too
but it's probably easier to review that way.

Basically it adds 3 parsing options, and for the old entry points
xmlDump* not xmlSave based it forces the XML_SAVE_AS_XML bypassing
the doc type in case of HTML documents. that should fix Stephan problem
and also provide ways to do things with xmlSave when available.
For the 'problem' of the added meta an XML_SAVE_IMMUTABLE option could
be added that sounds more generic, but i'm not adding this in the patch
to not complicate things.

I hope i didn't miss any old entry point which behaviour was modified in
2.7.1, and not missing places where the new flags should be checked too,

Daniel

-- 
Daniel Veillard      | libxml Gnome XML XSLT toolkit  http://xmlsoft.org/
daniel veillard com  | Rpmfind RPM search engine http://rpmfind.net/
http://veillard.com/ | virtualization library  http://libvirt.org/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]