Re: [xml] Problem with encoding in libxml.
- From: Daniel Veillard <veillard redhat com>
- To: "Arun S K (RBIN/EDM3) *" <Arun SK in bosch com>
- Cc: xml gnome org
- Subject: Re: [xml] Problem with encoding in libxml.
- Date: Tue, 13 Dec 2005 05:34:00 -0500
On Tue, Dec 13, 2005 at 03:55:11PM +0530, Arun S K (RBIN/EDM3) * wrote:
<?xml version="1.0" encoding="UTF8"?>
The document has the character ß (Beeta) in it. The parser aborts with the following message
--------------------------------------------------------------------
:13: parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0x80 0x20 0x3C 0x2F
<NAME>test_1ß</NAME>
--------------------------------------------------------------------
Is ß not a valid UTF8 character?
The character is part of unicode. But the sequence of bytes used to
express it are not valid in UTF-8. It is a fatal XML error.
How can this be corrected.
Replacing the wrong bytes in the instance by a sequence which is
valid for UTF-8.
Read the material pointed to at the beginning of
http://xmlsoft.org/encoding.html
Daniel
--
Daniel Veillard | Red Hat http://redhat.com/
veillard redhat com | libxml GNOME XML XSLT toolkit http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
[
Date Prev][
Date Next] [
Thread Prev][
Thread Next]
[
Thread Index]
[
Date Index]
[
Author Index]