[xml] Re: =?iso-8859-1?Q?=5Bxml=5D_What_is_the_difference_between_#xE9_and_=E9_=28?= =?iso-8859-1?Q?e_acute=29?=

On Thu, Jul 26, 2001 at 01:54:52PM +0200, paul gounot wrote:

I'm newbie in xml encoding and there is something I do not understand. I
would be very obliged if someone could answer me, please.

  In general asking newbies question to a specialized mailing list
where you are not subscribed to is not considered good behaviour !
The fact that it is a real FAQ and that is covered in any beginner's
book to XML is not good.

I would encode some wml page. Some of them have é and other have #xE9
instead of é.
Libxml2.4.0 encode those with #xE9 and doesn't encode those with é:

Entity: line 12: error: Input is not proper UTF-8, indicate encoding !
 <p><a href="cesoir.php">A la télé ce soir</a> </p>
Entity: line 12: error: Bytes: 0xE9 0x6C 0xE9 0x20
 <p><a href="cesoir.php">A la télé ce soir</a> </p>

But the sequence for "élé " is 0xE9 0x6C 0xE9 0x20, so the value for

How does libxml2.4.0 make the distinction in wml page between #xE9 and
é? What need to be done to make é character accepted?

  Read the eror message !!!
    "indicate the encoding !"
  Read the doc too !

 If you have other questions:

   1/ read the docs
   2/ subscribe to the list
   3/ get a book on XML and try to understand the underlying standard

 before posting or using it in a broken way.

 WML has been know for breaking the XML standard by not declaring the
encoding, you will get no sympathy around the XML community in general
if you don't understand why this specific example is broken and if you
don't fix your application.


Daniel Veillard      | Red Hat Network http://redhat.com/products/network/
veillard redhat com  | libxml Gnome XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/
Sep 17-18 2001 Brussels Red Hat TechWorld http://www.redhat-techworld.com

[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]