Re: [xml] UTF-8 decoding bug in HTML parser

On Wed, Oct 01, 2008 at 11:09:27AM +1000, Michael Day wrote:
Hi Daniel,

  Reusing the XML code for this seems to work fine for em and the
regression test, but you have probably a more extensive HTML test
suite than me ;-) so raise the problem if there is a regression !
Will commit to SVN with the test case,
Thanks, I'll check it out. I think this greatly helps the usability 
of  libxml2 for parsing HTML documents.

  the patch doesn't work for the push parser though and if i add it to
push lot of things breaks so it's not final ...

Is it possible to fix this in a way similar to the XML parser? While we  
don't use the push parser ourselves, it would be great if the patch  
could be merged, as I think it would help a lot of people.

  Okay, i hadn't time to fix teh push HTML parser too, but the
fix for htmlParseDocument is commited now.


Daniel Veillard      | libxml Gnome XML XSLT toolkit
daniel veillard com  | Rpmfind RPM search engine | virtualization library

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]