Re: [xml] Patch to improve HTMLparser's robustness



On Tue, 2008-04-22 at 15:56 +0200, Arnold Hendriks wrote:
 
Can I cheat? :) Given the fact that nothing should appear between 
</body> and </html>, and </html> is always the last tag, its' easiest to 
just ignore them and let the autoclose deal with it...

In practice I expect it's not uncommon to find text after </html> --
e.g. a site like geocities that appends ads to a user's page.

Liam

-- 
Liam Quin - XML Activity Lead, W3C, http://www.w3.org/People/Quin/
Pictures from old books: http://fromoldbooks.org/
Ankh: irc.sorcery.net irc.gnome.org www.advogato.org




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]