[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]

Re: [xml] SAX parser cannot handle UTF-8 characters below 0x20 except tab a nd linefeed



On Thu, Oct 24, 2002 at 04:42:56PM +0200, Steinborn Thomas wrote:
> Hello list,
> 
> the sax parser interface terminates with an internal error when parsing the
> attached document.
> 
> As far as I understand character codes below 0x20 should be all valid UTF-8
> character.

  Read the XML spec the set of character allowed is defined there:
    http://www.w3.org/TR/REC-xml#charsets

  the fact that a character can be encoded with Unicode doesn't
mean it's suitable for embedding in XML. 

Daniel

-- 
Daniel Veillard      | Red Hat Network https://rhn.redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]