[xml] What is the difference between #xE9 and =?iso-8859-1?Q?=E9?= (e acute)



Hello,

I'm newbie in xml encoding and there is something I do not understand. I
would be very obliged if someone could answer me, please.

I would encode some wml page. Some of them have é and other have #xE9
instead of é.
Libxml2.4.0 encode those with #xE9 and doesn't encode those with é:

Entity: line 12: error: Input is not proper UTF-8, indicate encoding !
 <p><a href="cesoir.php">A la télé ce soir</a> </p>
                               ^
Entity: line 12: error: Bytes: 0xE9 0x6C 0xE9 0x20
 <p><a href="cesoir.php">A la télé ce soir</a> </p>

But the sequence for "élé " is 0xE9 0x6C 0xE9 0x20, so the value for
é=0xE9.

How does libxml2.4.0 make the distinction in wml page between #xE9 and
é? What need to be done to make é character accepted?

Regards,

Paul.




[Date Prev][Date Next]   [Thread Prev][Thread Next]   [Thread Index] [Date Index] [Author Index]