Re: [xml] xmlReadFile with filename instead of URL?

From: Michael Day <mikeday yeslogic com>
To: xml gnome org
Cc: veillard redhat com
Subject: Re: [xml] xmlReadFile with filename instead of URL?
Date: Fri, 30 Mar 2007 09:43:43 +1000

The behaviour only seems to trigger when you configure --without-zlib. Idon't know why yet, but there are zlib specific #ifdefs in the loadingand URL mangling code, so there could be something funny going on thatisn't triggered when zlib is disabled.


Okay, I found the bug, it's very simple.

Here is the comment for xmlFileOpen:

 * Wrapper around xmlFileOpen_real that try it with an unescaped
 * version of @filename, if this fails fallback to @filename

However, the code does *not* do this:

    unescaped = xmlURIUnescapeString(filename, 0, NULL);
    if (unescaped != NULL) {
        retval = xmlFileOpen_real(unescaped);
        xmlFree(unescaped);
    } else {
        retval = xmlFileOpen_real(filename);
    }
    return retval;

The code is unescaping the filename first and trying to load that. If itfails, then it fails. Shouldn't it try and load the filename as-isfirst, and if *that* fails try unescaping it? Or better yet, not tryunescaping it all, I mean since when did filenames use % escapes anyway?


So I suggest this patch to xmlFileOpen in xmlIO.c:

    retval = xmlFileOpen_real(filename);
    if (retval == NULL) {
        unescaped = xmlURIUnescapeString(filename, 0, NULL);
        if (unescaped != NULL) {
            retval = xmlFileOpen_real(unescaped);
            xmlFree(unescaped);
        }
    }
    return retval;

With this code the file "hello%2Fworld.xml" will be loaded first, andonly if it is not found will "hello/world.xml" be loaded. But yeah, Iwould rather delete that entire if test, as it seems to me that any URLunescaping should be handled a lot earlier before xmlFileOpen sees it.


Michael

--
Print XML with Prince!
http://www.princexml.com

Follow-Ups:
- Re: [xml] xmlReadFile with filename instead of URL?
  - From: Daniel Veillard

References:
- [xml] xmlReadFile with filename instead of URL?
  - From: Michael Day
- Re: [xml] xmlReadFile with filename instead of URL?
  - From: Daniel Veillard
- Re: [xml] xmlReadFile with filename instead of URL?
  - From: Michael Day

[Date Prev][Date Next] [Thread Prev][Thread Next] [Thread Index] [Date Index] [Author Index]