Re: [xml] SAX parser cannot handle UTF-8 characters below 0x20 except tab a nd linefeed



On Thu, Oct 24, 2002 at 04:42:56PM +0200, Steinborn Thomas wrote:
Hello list,

the sax parser interface terminates with an internal error when parsing the
attached document.

As far as I understand character codes below 0x20 should be all valid UTF-8
character.

  Read the XML spec the set of character allowed is defined there:
    http://www.w3.org/TR/REC-xml#charsets

  the fact that a character can be encoded with Unicode doesn't
mean it's suitable for embedding in XML. 

Daniel

-- 
Daniel Veillard      | Red Hat Network https://rhn.redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]