[Date Prev][Date Next] [Thread Prev][Thread Next]
[Thread Index]
[Date Index]
[Author Index]
Re: [xml] Crash when parsing bad HTML
- From: Daniel Veillard <veillard redhat com>
- To: Pierre Belzile <pierre belzile idilia com>
- Cc: xml gnome org
- Subject: Re: [xml] Crash when parsing bad HTML
- Date: Thu, 23 Aug 2007 03:58:40 -0400
On Wed, Aug 22, 2007 at 07:16:51PM -0400, Pierre Belzile wrote:
>
> Hi,
> I'm using the HTML parser (htmlParseDocument) and my application
> segfaults when processing this document:
> <HTML>
> <PRE>
> Some text
> <?PRE>
> <HTML>
> The gdb traceback is:
> #0 0x0000002a96e3a4e4 in xmlSAX2ProcessingInstruction () from
> /usr/lib64/libxml2.so.2
> #1 0x0000002a96db9779 in htmlParseEntityRef () from
> /usr/lib64/libxml2.so.2
> #2 0x0000002a96dbb69a in htmlParseElement () from
> /usr/lib64/libxml2.so.2
> # ...
> Of course, it's bad HTML but an exception would be more acceptable. I
> tried using the latest official version (2.6.29) and the problem
> occurs there too. Is there a patch somewhere? If not, a hint would be
> appreciated because I'm going to have to fix it.
> Cheers, Pierre
I can't reproduce this. You MUST provide the full input document as an
attachment, I can't reproduce this with xmllint --html . If you do that
right now and I can reproduce this this will be fixed immediately as I think
I will do a release today. But with the current data you provided there is
nothing more I can do:
paphio:~/XML -> valgrind xmllint --html tst.html
tst.html:5: HTML parser error : htmlParseStartTag: misplaced <html> tag
<HTML>
^
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
<html><body><pre>
Some text
<?PRE></pre></body></html>
paphio:~/XML -> rpm -q libxml2
libxml2-2.6.29-1
paphio:~/XML -> valgrind /usr/bin/xmllint --html tst.html
tst.html:5: HTML parser error : htmlParseStartTag: misplaced <html> tag
<HTML>
^
<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" "http://www.w3.org/TR/REC-html40/loose.dtd">
<html><body><pre>
Some text
<?PRE></pre></body></html>
paphio:~/XML ->
At this point I have to expect the bug to be somewhere else in your
application because I can't reproduce it really !
Daniel
--
Red Hat Virtualization group http://redhat.com/virtualization/
Daniel Veillard | virtualization library http://libvirt.org/
veillard redhat com | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
[Date Prev][Date Next] [Thread Prev][Thread Next]
[Thread Index]
[Date Index]
[Author Index]