Re: [xml] problem with gzip decoding in nanohttp

From: "Liron" <magilam netvision net il>
To: <veillard redhat com>
Cc: xml gnome org
Subject: Re: [xml] problem with gzip decoding in nanohttp
Date: Thu, 26 Jan 2006 19:07:02 +0100

Hi Daniel,

You're absolutely right about the "don't fix it if it ain't broken" approachbut after reviewing the code I think that it's badly broken and I'llexplain:It's true that those functions do different things but the common thing forall of them is reading from a socket. As I explained, the bug I found willoccur in every function that reads content (as opposed to headers) from thesocket and not using "inflate" when the content is gzip encoded. The onlyfunction that decompresses the content is xmlNanoHTTPRead BUT it's not theonly function that delivers content.xmlNanoHTTPFetch is a function that opens a URL and saves the content tofile, if you're telling me that this file should contain encoded data incase that gzip is used then we don't have a problem (I think it's wrong butif it's by design...). But if you do expect the output file to contain htmlfor example then it won't work.

Like I said, I didn't review the code enough to make the changes (or getpermission to do so yet ;)) but it looks like xmlNanoHTTPRecv and ONLYxmlNanoHTTPRecv should be changed to support the decompression since everyfunction that reads from the socket calls it (but not every function thatcalls it decompresses the content)

If you want to test it, simple use xmlNanoHTTPFetch on any site thatsupports gzip and see what I mean.

Another issue I wanted to ask you about is the content-length when thecontent is compressed. If content-length header exists it'll return thelength of the compressed data so I applied a new function that simplyreturns whether the content is gziped or not. This is good for applicationsthat depends on the content-length header for buffer allocations so theyshould be aware if this field contains the right number or not (Just ahelper function).


I hope I made myself clear here, please let me know what you think
Liron

----- Original Message -----From: "Daniel Veillard" <veillard redhat com>

To: "Liron" <magilam netvision net il>
Cc: <xml gnome org>
Sent: Thursday, January 26, 2006 6:09 PM
Subject: Re: [xml] problem with gzip decoding in nanohttp

 I still don't understand your problem w.r.t. the code.
All those function do different things as their doc explains, some read
at the low level, some are line read public API, some are block read API
and the last one read and save to a file. The functions are different
because they implement different interface, to me that's fine, they
are actually layered.
 So explain the bug(s), explain why you think there is duplication,
I can't see anything obvious even in the light of your last message.
And if it's only 'cleanup' sometimes the 'don't fix it if it ain't
broken' approach to maintainance is just fine, but I'm listening.

Daniel

--
Daniel Veillard      | Red Hat http://redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/

http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/

References:
- Re: [xml] problem with gzip decoding in nanohttp
  - From: Liron
- Re: [xml] problem with gzip decoding in nanohttp
  - From: Daniel Veillard

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]