[xml] xmllint --html --xmlout

From: Elliotte Harold <elharo metalab unc edu>
To: xml gnome org
Subject: [xml] xmllint --html --xmlout
Date: Mon, 12 Feb 2007 07:42:13 -0500

How robust is

xmllint --html --xmlout

Is it possible to confuse it so badly it won't continue or will generateill-formed markup? Or will it keep on trucking no matter what?

How does the HTML parser handle bogons (unrecognized elements)? Are theytreated as empty or dropped or something else?


How good an alternative is this for TagSoup and Tidy?

I'm working on a book about converting messy old HTML to clean XHTML,and I'm trying to decide exactly how much of each tool to recommend when.


--
ïElliotte Rusty Harold  elharo metalab unc edu
Java I/O 2nd Edition Just Published!
http://www.cafeaulait.org/books/javaio2/
http://www.amazon.com/exec/obidos/ISBN=0596527500/ref=nosim/cafeaulaitA/

Follow-Ups:
- Re: [xml] xmllint --html --xmlout
  - From: Daniel Veillard

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]