Re: [xml] Where can I find more examples about HTML-parsing?
- From: Wei Chen <chenwei624 yahoo com>
- To: veillard redhat com
- Cc: xml gnome org
- Subject: Re: [xml] Where can I find more examples about HTML-parsing?
- Date: Thu, 18 Dec 2003 11:44:05 -0800 (PST)
Hi Daniel,
The file "xmllint.c" does less as the "testHTML.c"
since it just simply parse the HTML-file. What the
"testHTML.c" does, it can print out every tag and the
text, attribute in detail as shown as follow:
...
...
ATTRIBUTE width
TEXT
content=604
ATTRIBUTE frameborder
TEXT
content=0
TEXT
content=
TEXT
content=
TEXT
content=
COMMENT
content= Navigations Spalte wird
ELEMENT tr
ELEMENT td
ATTRIBUTE id
TEXT
content=navigation
content=
...
...
Surely it's good. And as I debug the program I find it
uses the debug functions such as
xmlDebugDumpOneNode()
xmlDebugDumpNode()
xmlDebugDumpNodeList()
etc.
But What I want ist somewhat different. I don't want
to print out but save to a buffer. Are there any other
functions which do the same? I mean I want to get the
retured valure as a string (char* or char[]).
Besides, is there any HTML-tree structure description?
I find searching an HTML-tree is quite complicated if
one doesn't know the tree structure.
Regards
Wei Chen
--- Daniel Veillard <veillard redhat com> wrote:
On Wed, Dec 17, 2003 at 11:26:09PM -0800, Wei Chen
wrote:
After I've downloaded the libxml2 package I just
find
one example about HTML-parsing "testHTML.c". Where
can
I find more?
in xmllint.c the code handling --html
Daniel
--
Daniel Veillard | Red Hat Network
https://rhn.redhat.com/
veillard redhat com | libxml GNOME XML XSLT toolkit
http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine
http://rpmfind.net/
_______________________________________________
xml mailing list, project page http://xmlsoft.org/
xml gnome org
http://mail.gnome.org/mailman/listinfo/xml
__________________________________
Do you Yahoo!?
New Yahoo! Photos - easier uploading and sharing.
http://photos.yahoo.com/
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]