Re: [xml] Problem with encoding in libxml.



On Tue, Dec 13, 2005 at 03:55:11PM +0530, Arun S K (RBIN/EDM3) * wrote:
<?xml version="1.0" encoding="UTF8"?>

The document has the character ß (Beeta) in it. The parser aborts with the following message 
--------------------------------------------------------------------
:13: parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0x80 0x20 0x3C 0x2F
                              <NAME>test_1ß</NAME>
--------------------------------------------------------------------

Is ß not a valid UTF8 character?

  The character is part of unicode. But the sequence of bytes used to
express it are not valid in UTF-8. It is a fatal XML error.

How can this be corrected.

  Replacing the wrong bytes in the instance by a sequence which is
valid for UTF-8.
   Read the material pointed to at the beginning of
     http://xmlsoft.org/encoding.html

Daniel

-- 
Daniel Veillard      | Red Hat http://redhat.com/
veillard redhat com  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/



[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]