Re: [xml] Patch to improve HTMLparser's robustness
- From: Liam R E Quin <liam holoweb net>
- To: Arnold Hendriks <a hendriks b-lex nl>
- Cc: xml gnome org, veillard redhat com
- Subject: Re: [xml] Patch to improve HTMLparser's robustness
- Date: Tue, 22 Apr 2008 12:39:28 -0400
On Tue, 2008-04-22 at 15:56 +0200, Arnold Hendriks wrote:
Can I cheat? :) Given the fact that nothing should appear between
</body> and </html>, and </html> is always the last tag, its' easiest to
just ignore them and let the autoclose deal with it...
In practice I expect it's not uncommon to find text after </html> --
e.g. a site like geocities that appends ads to a user's page.
Liam
--
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org www.advogato.org
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]