Re: [xml] Patch for HTMLparser
- From: Daniel Veillard <veillard redhat com>
- To: James Bursa <bursa users sourceforge net>
- Cc: xml gnome org
- Subject: Re: [xml] Patch for HTMLparser
- Date: Thu, 20 Nov 2003 16:59:16 -0500
On Thu, Nov 20, 2003 at 09:40:23PM +0000, James Bursa wrote:
Below is a minor patch for HTMLparser.c:
1. Handle hex character entities like ģ, ie. a capital X.
Hum, I didnt know that form was possible, indeed
http://www.w3.org/TR/REC-html40/charset.html#h-5.3.1
I learnt something :-)
2. Skip to the end of misplaced <body> start tags. Currently any attributes
of a misplaced <body> are parsed as text and included as a <p> element in
the tree.
eek :-), sounds good too.
Applied and commited, thanks a lot !
Daniel
--
Daniel Veillard | Red Hat Network https://rhn.redhat.com/
veillard redhat com | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]