[Date Prev][Date Next] [Thread Prev][Thread Next]
[Thread Index]
[Date Index]
[Author Index]
Re: [xml] Parsing tag-soup HTML
- From: Stefan Behnel <stefan_ml behnel de>
- To: Nick Kew <nick webthing com>
- Cc: xml gnome org
- Subject: Re: [xml] Parsing tag-soup HTML
- Date: Mon, 18 Jun 2007 15:08:53 +0200
Nick Kew wrote:
> On Mon, 18 Jun 2007 08:14:01 -0400
> Try running the following through "xmllint --html":
>
> <meta http-equiv="content-type" content="text/html;charset=ascii" />
> <html lang="en">
> <head><title>foo</title></head>
> <body><h1>Hello, World</h1></body>
> </html>
In that case I would actually prefer making it a general special case rule in
the current parser to interpret a leading <meta> tag as an encoding hint to
the parser. That would add quite a portion of real-world non-HTML to the set
of parsable (i.e. fixable) documents.
Stefan
[Date Prev][Date Next] [Thread Prev][Thread Next]
[Thread Index]
[Date Index]
[Author Index]