Re: [xml] Where can I find more examples about HTML-parsing?



Hi Daniel,

The file "xmllint.c" does less as the "testHTML.c"
since it just simply parse the HTML-file. What the
"testHTML.c" does, it can print out every tag and the
text, attribute in detail as shown as follow:
...
...
            ATTRIBUTE width
              TEXT
                content=604
            ATTRIBUTE frameborder
              TEXT
                content=0
          TEXT
            content=
        TEXT
          content=
      TEXT
        content=
    COMMENT
      content= Navigations Spalte wird
    ELEMENT tr
      ELEMENT td
        ATTRIBUTE id
          TEXT
            content=navigation
         content=
...
...
Surely it's good. And as I debug the program I find it
uses the debug functions such as 
xmlDebugDumpOneNode()
xmlDebugDumpNode()
xmlDebugDumpNodeList()
etc.

But What I want ist somewhat different. I don't want
to print out but save to a buffer. Are there any other
functions which do the same? I mean I want to get the
retured valure as a string (char* or char[]).

Besides, is there any HTML-tree structure description?
I find searching an HTML-tree is quite complicated if
one doesn't know the tree structure.

Regards

Wei Chen




--- Daniel Veillard <veillard redhat com> wrote:
On Wed, Dec 17, 2003 at 11:26:09PM -0800, Wei Chen
wrote:
After I've downloaded the libxml2 package I just
find
one example about HTML-parsing "testHTML.c". Where
can
I find more?

  in xmllint.c the code handling --html

Daniel

-- 
Daniel Veillard      | Red Hat Network
https://rhn.redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit
 http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine
http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
xml gnome org
http://mail.gnome.org/mailman/listinfo/xml


__________________________________
Do you Yahoo!?
New Yahoo! Photos - easier uploading and sharing.
http://photos.yahoo.com/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]