Re: [xml] HTML Parser problems with chunk parser if HTML keywordsoverlap chunk border



On Thu, Jun 22, 2006 at 12:46:29PM +0200, Cyrill Osterwalder wrote:

I think delaying calling the parser if "</" is present in 
the last 8 character would be somewhat broken. 
You could perfectly find a number of
other elements after the script/style block (actually I would 
expect that)
and those need to be closed.

I see your point. However, I'm not sure that it wouldn't work. If we
wait until we have a chunk that does not have "</" in the trailing 8
characters and we call htmlParseScript() at that point, it should be
guaranteed that htmlParseScript() either reaches its breaking condition
or just consumes normal CDATA. If there are other elements after the
script/style block, they will be parsed correctly once htmlParseScript()
breaks, wouldn't they?

  Hum, okay after thinking a bit more about it, yes.

Daniel

-- 
Daniel Veillard      | Red Hat http://redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]