Re: [xml] Parsing tag-soup HTML





Nick Kew wrote:
On Mon, 18 Jun 2007 08:14:01 -0400
Try running the following through "xmllint --html":

<meta http-equiv="content-type" content="text/html;charset=ascii" />
<html lang="en">
<head><title>foo</title></head>
<body><h1>Hello, World</h1></body>
</html>

In that case I would actually prefer making it a general special case rule in
the current parser to interpret a leading <meta> tag as an encoding hint to
the parser. That would add quite a portion of real-world non-HTML to the set
of parsable (i.e. fixable) documents.

Stefan




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]