RE: [xml] HTMLparser: body/noscript mismatch?
- From: "Cyrill Osterwalder" <cyrill osterwalder tonsil ch>
- To: <veillard redhat com>, "'Cyrill Osterwalder'" <cyrill osterwalder tonsil ch>
- Cc: xml gnome org
- Subject: RE: [xml] HTMLparser: body/noscript mismatch?
- Date: Fri, 21 Jan 2005 18:35:02 +0100
Thanks for your response!
Yes, that seems to be the case. I'm not particularly fond of <noscript>
either but there are some applications using it. If the <body> tag is
implied by the parser it makes the page impossible to render.
I'll try to check the source to add the noscript to the html level. As I
understand the following link:
http://www.w3.org/TR/REC-html40/sgml/dtd.html#block
the <noscript> block can be a child of <html>.
Cyrill
-----Original Message-----
From: Daniel Veillard [mailto:veillard redhat com]
Sent: Freitag, 21. Januar 2005 18:07
To: Cyrill Osterwalder
Cc: xml gnome org
Subject: Re: [xml] HTMLparser: body/noscript mismatch?
On Fri, Jan 21, 2005 at 05:30:04PM +0100, Cyrill Osterwalder wrote:
Hi there!
I'm stuck with a <body>/<noscript> HTMLparser mismatch and
I don't know if
I'm seeing things wrong. It would be great if anybody could
clarify. It
seems that the body/noscript blocks are parsed in the wrong order.
I put the following HTML code into the libxml2-2.6.16 HTMLparser:
<html>
<head>
<meta>http-equiv="Content-Type" content="text/html; charset=utf-8">
<title>TEST</title>
</head>
<noscript>
<body text="#000000">
TEST-NOSCRIPT
</body>
</noscript>
<body>
anything else
</body>
</html>
Using the SAX interface for the HTMLparser I get the
following calls:
start html
start head
start meta
end meta
start title
end title
end head
start body (<---- expecting noscript here)
start noscript (<---- expecting body here)
end noscript
end body
start body
start p
end p
end body
end html
The <body> and <noscript> blocks are confused and the
result is not very
much appreciated by browsers. Am I missing something on my
side or is this
unwanted behaviour of the HTMLparser?
The HTML parser seems to consider that only head and body
are allowed
as child of html, and so it opens a body when seeing the noscript,
Any hints would be helpful, thanks.
First time I hear about noscript...
Daniel
--
Daniel Veillard | Red Hat Desktop team http://redhat.com/
veillard redhat com | libxml GNOME XML XSLT toolkit
http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]