Re: [xml] UTF-8 decoding bug in HTML parser
- From: Daniel Veillard <veillard redhat com>
- To: Michael Day <mikeday yeslogic com>
- Cc: xml gnome org
- Subject: Re: [xml] UTF-8 decoding bug in HTML parser
- Date: Fri, 3 Oct 2008 10:08:55 +0200
On Wed, Oct 01, 2008 at 11:09:27AM +1000, Michael Day wrote:
Reusing the XML code for this seems to work fine for em and the
regression test, but you have probably a more extensive HTML test
suite than me ;-) so raise the problem if there is a regression !
Will commit to SVN with the test case,
Thanks, I'll check it out. I think this greatly helps the usability
of libxml2 for parsing HTML documents.
the patch doesn't work for the push parser though and if i add it to
push lot of things breaks so it's not final ...
Is it possible to fix this in a way similar to the XML parser? While we
don't use the push parser ourselves, it would be great if the patch
could be merged, as I think it would help a lot of people.
Okay, i hadn't time to fix teh push HTML parser too, but the
fix for htmlParseDocument is commited now.
Daniel Veillard | libxml Gnome XML XSLT toolkit http://xmlsoft.org/
daniel veillard com | Rpmfind RPM search engine http://rpmfind.net/
http://veillard.com/ | virtualization library http://libvirt.org/
] [Thread Prev